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Abstract 

A wealth of astronomical data indicate the presence of mass discrepancies in the Universe. 
The motions observed in a variety of classes of extragalactic systems exceed what can be 
explained by the mass visible in stars and gas. Either (i) there is a vast amount of unseen mass 
in some novel form - dark matter - or (ii) the data indicate a breakdown of our understanding 
of dynamics on the relevant scales, or (iii) both. Here, we first review a few outstanding 
challenges for the dark matter interpretation of mass discrepancies in galaxies, purely based 
on observations and independently of any alternative theoretical framework. We then show 
that many of these puzzling observations are predicted by one single relation - Milgrom's law 
- involving an acceleration constant ao (or a characteristic surface density Ef = ao/G) of the 
order of the square-root of the cosmological constant in natural units. This relation can at 
present most easily be interpreted as the effect of a single universal force law resulting from a 
modification of Newtonian dynamics (MOND) on galactic scales. We exhaustively review the 
current observational successes and problems of this alternative paradigm at all astrophysical 
scales, and summarize the various theoretical attempts (TeVeS, GEA, BIMOND, and others) 
made to effectively embed this modification of Newtonian dynamics within a relativistic theory 
of gravity. 
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1 Introduction 



Two of the most tantalizing mysteries of modern astrophysics are known as the dark matter and 
dark energy problems. These problems come from the discrepancies between, on one side, the 
observations of galactic and extragalactic systems (as well as the observable Universe itself in the 
case of dark energy) by astronomical means, and on the other side, the predictions of General Rel- 
ativity from the observed amount of matter-energy in these systems. In short, what astronomical 
observations are telling us is that the dynamics of galactic and extragalactic systems as well as 
the expansion of the Universe itself do not correspond to the observed mass-energy as they should 
if our understanding of gravity is complete. This thus indicates either (i) the presence of unseen 
(and yet unknown) mass-energy, or (ii) a failure of our theory of gravity, or (iii) both. 

The third case is a priori the most plausible, as there are good reasons for there being more 
particles than those of the standard model of particle physics |258] (actually, even in the case of 
baryons, we suspect that a lot of them have not been seen yet and thus literally make up unseen 
mass, in the form of "missing baryons"), and as there is a priori no reason that General Relativity 
should be valid over a wide range of scales where it has never been tested |46j , and where the 
need for a dark sector actually prevents the theory from being tested until this sector has been 
detected by other means than gravity itselfl]. However, either of the first two cases could be the 
dominant explanation of the discrepancies in a given class of astronomical systems (or even in all 
astronomical systems), and this is actually testable. 

For instance, as far as (ii) is concerned, if the mass discrepancies in a class of systems are mostly 
caused by some subtle change in gravitational physics, then there should be a clear signature of 
a single, universal force law at work in this whole class of systems. If instead there is a distinct 
dark matter component in these, the kinematics of any given system should then depend on the 
particular distribution of both dark and luminous mass. This distribution would vary from system 
to system, depending on their environment and past history of formation, and should in principle 
not result in anything like an apparent universal force la'KQ. 

Over the years, there have been a large variety of such attempts to alter the theory of gravity 
in order to remove the need for dark matter and/or dark energy. In the case of dark energy, there 
is some wiggle room, but in the case of dark matter, most of these alternative gravity attempts 
fail very quickly, and for a simple reason: once a force law is specified, it must fit all relevant 
kinematic data in a given class of systems, with the mass distribution specified by the visible 
matter only. This is a tall order with essentially zero wiggle room: at most one particular force 
law can work. However, among all these attempts, there is one survivor: the Modified Newtonian 
Dynamics (MOND) hypothesized by Milgrom almost 30 years ago [295| I296[ 1294] seems to come 
close to satisfying the criterion of a universal force law in a whole class of systems, namely galaxies. 
This success implies a unique relationship between the distribution of baryons and the gravitational 
field in galaxies and is extremely hard to understand within the present dominant paradigm of the 
concordance cosmological model, hypothesizing that General Relativity is correct on every relevant 
scale in cosmology including galactic scales, and that the dark sector in galaxies is made of non- 
baryonic dissipationless and collisionless particles. Even if such particles are detected directly in 
the near to far future, the success of MOND on galaxy scales as a phenomenological law, as well as 
the associated appearance of a universal critical acceleration constant oq ~ 10"^" ms~^ in various 
seemingly unrelated aspects of galaxy dynamics, will anyway have to be explained and understood 
by any successful model of galaxy formation and evolution. Previous reviews of various aspects 
of MOND, at an observational and theoretical level, can be found in [35l |82l \lUB ESI [2791 EEl 

^ Up to now, all the dark matter particle candidates still elude both direct and indirect non-gravitational detec- 
tion. 

However, a way to effectively reproduce an apparent universal force law from an exotic dark component could 
be to enforce an intimate connection between the distribution of baryons, the dark component, and the gravitational 
field through, e.g., a fifth force effect. This possibility will be extensively discussed in Section [7] notably Section l7.9l 
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I32H 13991 1408| 1430] , A blog- website dedicated to this topic is also maintained, with all the relevant 
literature as well as introductory level articles [264] (see also |239| V 

Here, we first review the basics of the dark matter problem (Section [5]) as well as the basic 
ingredients of the present-day concordance model of cosmology (Section [3]). We then point out a 
few outstanding challenges for this model (Section |4]), both from the point of view of unobserved 
predictions of the model, and from the point of view of unpredicted observations (all uncannily 
involving a common acceleration constant oq). Up to that point, the challenges presented are purely 
based on observations, and are fully independent of any alternative theoretical framewor4f|. We 
then show that, surprisingly, many of these puzzling observations can be summarized within one 
single empirical law, Milgrom's law (Section[5]), which can be most easily (although not necessarily 
uniquely) interpreted as the effect of a single universal force law resulting from a modification of 
Newtonian dynamics (MOND) in the weak-acceleration regime a < ao, for which we present the 
current observational successes and problems (Section [5]). We then summarize the various attempts 
currently made to embed this modification in a generally covariant relativistic theory of gravity 
(Section [T]) and how such theories allow new predictions on gravitational lensing (Section [S]) and 
cosmology (Section [9]). We finally draw conclusions in Section [TO] 



^ The first four sections provide the observational evidence for the MOND phenomcnofogy through the different 
appearances of ao in galactic dynamics, but they are actually independent of any specific theory, while the reader 
more specifically interested into MOND per se could go directly to Section [5] 
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2 The Missing Mass Problem in a Nutshell 



There exists overwhelming evidence for mass discrepancies in the Universe from multiple indepen- 
dent observations. This evidence involves the dynamics of extragalactic systems: the motions of 
stars and gas in galaxies and clusters of galaxies. Further evidence is provided by gravitational 
lensing, the temperature of hot, X-ray emitting gas in clusters of galaxies, the large scale struc- 
ture of the Universe, and the gravitating mass density of the Universe itself (Figure [T]) . For an 
exhaustive historical review of the problem, we refer the reader to |394] . 

The data leave no doubt that when the law of gravity as currently known is applied to extra- 
galactic systems, it fails if only the observed stars and gas are included as sources in the stress- 
energy tensor. This leads to a stark choice: either the Universe is pervaded by some unseen form of 
mass - dark matter ~ or the dynamical laws that lead to this inference require revision. Though the 
mass discrepancy problem is now well established [3M1 1466] , such a dramatic assertion warrants a 
brief review of the evidence. 

Historically, the first indications of the modern missing mass problem came in the 1930s shortly 
after galaxies were recognized to be extragalactic in nature. Oort j343) noted that the sum of the 
observed stars in the vicinity of the sun fell short of explaining the vertical motions of stars in 
the disk of the Milky Way. The luminous matter did not provide a sufficient restoring force for 
the observed stellar vertical oscillations. This became known as the Oort discrepancy. Around the 
same time, Zwicky [519] reported that the velocity dispersion of galaxies in clusters of galaxies was 
far too high for these objects to remain bound for a substantial fraction of cosmic time. The Oort 
discrepancy was approximately a factor of two in amplitude, and confined to the Galactic disk - it 
required local dark matter, not necessarily the quasi-spherical halo we now envision. It was long 
considered a serious problem, but has now largely (though perhaps not fully) gone away |195[ 1241] . 
The discrepancy Zwicky reported was less subtle, as the required dark mass outweighed the visible 
stars by a factor of at least 100. This result was apparently not taken seriously at the time. 

One of the first indications of the need for dark matter in modern times came from the stability 
of galactic disks. Stars in spiral galaxies like the Milky Way are predominantly on approximately 
circular orbits, with relatively few on highly eccentric orbits |133] . The small velocity dispersion 
of stars relative to their circular velocities makes galactic disks dynamically cold. Early simula- 
tions [344] revealed that cold, self-gravitating disks were subject to severe instabilities. In order to 
prevent the rapid, self-destructive growth of these instabilities, and hence preserve the existence 
of spiral galaxies over a sizable fraction of a Hubble time, it was found to be necessary to embed 
the disk in a quasi-spherical potential well - a role that could be played by a halo of dark matter, 
as first proposed in 1973 by Ostriker & Peebles [344] . 

Perhaps the most persuasive piece of evidence was then provided, notably through the seminal 
works of Bosma and Rubin, by establishing that the rotation curves of spiral galaxies are approx- 
imately flat [68| 1371] . A system obeying Newton's law of gravity should have a rotation curve 
that, like the Solar system, declines in a Keplerian manner once the bulk of the mass is enclosed: 
Vc oc r^^/^. Instead, observations indicated that spiral galaxy rotation curves tended to remain 
approximately flat with increasing radius: Vc ~ constant. This was shown to happen over and over 
and over again |371] with the approximate flatness of the rotation curve persisting to the largest 
radii observable [68], well beyond where the details of each galaxy's mass distribution mattered so 
that Keplerian behavior should have been observed. Again, a quasi-spherical halo of dark matter 
as proposed by Ostriker and Peebles was implicated. 

Other types of galaxies exhibit mass discrepancies as well. Perhaps most notable are the dwarf 
spheroidal galaxies that are satellites of the Milky Way [4281 1478] and of Andromeda [218] . These 
satellites are tiny by galaxy standards, possessing only millions, or in the case of the so-called 
ultrafaint dwarfs, thousands, of individual stars. They are close enough that the line-of-sight 
velocities of individual stars can be measured, providing for a precise measurement of the system's 
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Figure 1: Summary of the empirical roots of the missing mass problem (below line) and the generic 
possibilities for its solution (above line). Illustrated lines of evidence include the approximate 
flatness of the rotation curves of spiral galaxies, gravitational lensing in a cluster of galaxies, and 
the growth of large scale structure from an initially very nearly homogeneous early Universe. Other 
historically important lines of evidence include the Oort discrepancy, the need to stabilize galactic 
disks, motions of galaxies within clusters of galaxies and the hydrodynamics of hot, X-ray emitting 
gas therein, and the apparent excess of gravitating mass density over the mass density of baryons 
permitted by big bang nucleosynthesis. From these many distinct problems grow several possible 
solutions. Generically, the observed discrepancies either imply the existence of dark matter, or 
the necessity to modify dynamical laws. Dark matter could in principle be any combination of 
non-luminous baryons and/or some non-baryonic form of mass like neutrinos (hot dark matter) 
or some new particle whose mass makes it dynamically cold or perhaps warm. Alternatively, the 
observed discrepancies might point to the need to modify the equation of gravity that is employed 
to infer the existence of dark matter, or perhaps some other fundamental dynamical assumption 
like the equivalence of inertial mass and gravitational charge. Many specific ideas of each of these 
types have been considered over the years. Note that none of these ideas are mutually exclusive, 
and that some form or the other of dark matter could happily cohabit with a modification of the 
gravitational law, or could even be itself the cause of an effective modification of the gravitational 
law. Question marks on some tree branches represent the fruit of ideas yet to be had. Perhaps 
these might also address the dark energy problem, with the most satisfactory result being a theory 
that would simultaneously explain the acceleration scale in the dark matter problem as well as the 
accelerating expansion of the Universe, and explain the coincidence of scales between these two 
problems, a coincidence exhibited in Sect. 4.1. 
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velocity dispersion. The mass inferred from these motions (roughly, M ^ ra'^/G) greatly exceeds 
the mass visible in luminous stars. Indeed, these dim satellite galaxies exhibit some of the largest 
mass discrepancies observed. In contrast, bright giant elliptical galaxies (often composed of much 
more than the ~ 10^^ stars of the Milky Way) exhibit remarkably modest and hard to detect mass 
discrepancies [368] . It is thus inferred that fainter galaxies are progressively more dark matter 
dominated than bright ones. However, as we shall expand on in Sect. 4.3, the primary correlation 
is not with luminosity, but with surface brightness: the lower the surface brightness of a system, 
the larger its mass discrepancy |279| . 

On larger scales, groups and clusters of galaxies also show mass discrepancies, just as individual 
galaxies do. One of the earliest lines of evidence comes from the so-called "timing argument" 
in the Local Group 1214] . Presumably the material that was to become the Milky Way and 
Andromeda (M31) was initially expanding apart with the general Hubble expansion. Currently 
they are approaching one another at ~ 100 kms~^. In order for the Milky Way and M31 to 
have overcome the initial expansion and fallen back towards one another, there must be a greater 
than average gravitating mass between the two. To arrive at their present separation with the 
observed blueshifted line of sight velocity after a Hubble time requires a dynamical mass-to-light 
ratio M/L > 80. This greatly exceeds the mass-to-light ratio of the stars themselves, which is of 
order unity in Solar units ,43, (the Sun is a fairly average star, so averaged over many stars each 
Solar mass produces roughly one Solar luminosity). 

Rich clusters of galaxies are rare structures containing dozens or even hundreds of bright galax- 
ies. These objects exhibit mass discrepancies in several distinct ways. Measurements of the red- 
shifts of individual cluster members give velocity dispersions in the vicinity of 1, 000 kms~^ typi- 
cally implying dynamical mass-to-light ratios in excess of 100 [21] • The actual mass discrepancy 
is not this large, as most of the detected baryonic mass in clusters is in a diffuse intracluster gas 
rather than in the stars in the galaxies (something Zwicky was not aware of back in 1933). This 
gas is heated to the virial temperature and emits X-rays. Mapping the temperature and emission 
of this X-ray gas provides another probe of the cluster mass through the equation of hydrostatic 
equilibrium. In order to hold the gas in the clusters at the observed temperatures requires there to 
be dark matter that outweighs the gas by a factor of ^ 8 [176j . Furthermore, some clusters are ob- 
served to gravitationally lens background galaxies (Figure [1]). Once again, mass above and beyond 
that observed is required to explain this phenomenon |228J . Thus three independent methods all 
imply the need for about the same amount of dark matter in clusters of galaxies. 

In addition to the abundant evidence for mass discrepancies in the dynamics of extragalactic 
systems, there are also strong motivations for dark matter in cosmology. Two observations are 
particularly important: (i) the small baryonic mass density flf, inferred from big bang nucleosyn- 
thesis (and from the measured Hubble parameter), and (ii) the growth of large scale structure 
by a factor of ~ 10^ from the surface of last scattering of the cosmic microwave background at 
redshift z ~ 1000 until present-day z — 0, implying fi™ > flf,. Together, these observations thus 
imply not only the need for dark matter, but for some exotic new form of non-baryonic cold dark 
matter. Indeed, observational estimates of the gravitating mass density of the Universe flm, mea- 
sured for instance from peculiar galaxy (or large-scale) velocity fields, have for several decades 
persistently returned values in the range 1/4 < $7^ < 1/3 [117] . While shy of the value needed 
for a flat Universe, this mass density is well in excess of the baryon density inferred from big bang 
nucleosynthesis (BBN). The observed abundances of the light isotopes deuterium, helium, and 
lithium are consistent with having been produced in the first few minutes after the big bang if the 
baryon density is just a few percent of the critical value: fli, < 0.05 ^4811 1108"] . Thus il™ > ili,. 
Consequently, we don't just need dark matter, we need the dark matter to be non-baryonic. 

Another early Universe constraint is provided by the Cosmic Microwave Background (CMB). 
The small (microKelvin) amplitude of the temperature fluctuations at the time of baryon-photon 
decoupling (z ^ 1000) indicates that the Universe was initially very homogeneous, roughly to one 
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part in 10^. The Universe today {z — 0) is very inhomogeneous, at least on "small" scales of 
less than ^ 100 Mpc 3 x 10^ light-years), with huge density contrasts between planets, stars, 
galaxies, clusters, and empty intergalactic space. The only attractive long-range force acting on 
the entire Universe, that can make such structures, is gravity. In a rich-get-richer while the poor- 
get-poorer process, the small initial over-densities attract more mass and grow into structures like 
galaxies while under-dense regions become less dense, leading to voids. The catch is that gravity 
is rather weak, so this process takes a long time. If the baryon density from BBN is all we have 
to work with, we can only obtain a growth factor of ^ 10^ in a Hubble time |425j . orders of 
magnitude short of the observed 10^. The solution is to boost the growth rate with extra invisible 
mass displaying larger density fluctuations: dark matter. In order not to make the same mark on 
the CMB that baryons would, this dark matter must not interact with the photons. So, in effect, 
the density fluctuations in the dark matter can already be very large at the epoch of baryon- 
photon decoupling, especially if the dark matter is cold (i.e., with effectively zero Jeans length). 
The baryons fall into the already deep dark matter potential wells only after that, once released 
from their electromagnetic link to the photon bath. Before decoupling, the fluctuations in the 
baryon-photon fluid did not grow but were oscillating in the form of acoustic waves, maintaining 
the same amplitude as when they entered the horizon; actually they were even slightly diffusion- 
damped. In principle, at baryon-photon decoupling, CMB fluctuations on smaller angular scales, 
having entered the horizon earlier, would thus have been damped with respect to those on larger 
scales (Silk damping). Nevertheless, the presence of decoupled non-baryonic dark matter would 
provide a net forcing term countering the damping of the oscillations at recombination, meaning 
that the second and third acoustic peaks of the CMB could then be of equal amplitude rather than 
exhibiting a damping tail. The actual observation of a high third-peak in the CMB angular power 
spectrum is thus another compelling evidence for non-baryonic dark matter (see, e.g., [230'). Both 
BBN and the CMB thus drive us to consider a form of mass that is non-baryonic and which does 
not interact electromagnetically. Moreover, in order to form structure (see Sect. 3.2), the mass 
must be dynamically cold (i.e., moving much slower than the speed of light when it decouples from 
the photon bath), and is known as cold dark matter (CDM). 

Now, in addition to CDM, modern cosmology also requires something even more mysterious, 
dubbed dark energy. The fact that the baryon fraction in clusters of galaxies was such that D,m 
was implied to be much smaller than 1 ~ the value needed for a flat Euclidean Universe favored by 
inflationary models - , as well as tensions between the measured Hubble parameter and independent 
estimates of the age of the Universe, led Ostriker & Steinhardt [34F to propose in 1995 a so-called 
"concordance model of cosmology" or ACDM model, where a cosmological constant A - supposed to 
represent vacuum energy or dark energy - provided the major contribution to the Universe's energy 
density. Three years later, the observations of SNIa [3521 1366) indicating late-time acceleration of 
the Universe's expansion, led most people to accept this model. This concordance model has since 
been refined and calibrated through subsequent large-scale observations of the CMB and of the 
matter power spectrum, to lead to the favored cosmological model prevailing today (see Sect. 3). 
However, as we shall see, curious coincidences of scales between the dark matter and dark energy 
sectors (see Sect. 4.1) have prompted the question of whether these two sectors are really physically 
independent, and the existence of dark energy itself has led to a renewed interest in modified gravity 
theories as a possible alternative to this exotic fluid [lOlJ . 
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3 A Brief Overview of the ACDM Cosmological Model 



General relativity provides a clear and compelling cosmology, the Friedmann-Lemaitre-Robertson- 
Walker (FLRW) model. The expansion of the Universe discovered by Hubble and Slipher found a 
natural explanatiorQ in this context. The picture of a hot big bang cosmology that emerged from 
this model famously predicted the existence of the 3 degree CMB and the abundances of the light 
isotopes via BBN. 

Within the FLRW framework, we are inexorably driven to infer the existence of both non- 
baryonic cold dark matter and a non-zero cosmological constant as discussed in Sect. The 
resulting concordance ACDM model - first proposed in 1995 by Ostriker and Steinhardt |345| - is 
encouraged by a wealth of observations: the consistency of the Hubble parameter with the ages of 
the oldest stars f 345| . the consistency between the dynamical mass density of the Universe, that of 
baryons from BBN (see also discussion in Sect. 9.2), and the baryon fraction of clusters |487| . as 
well as the power spectrum of density perturbations [1041 1453] . A prediction of the concordance 
model is that the expansion rate of the Universe should be accelerating; this was confirmed by 
observations of high redshift Type la supernovae [352, 366 . Another successful prediction was the 
scale of the baryonic acoustic oscillation [135j . Perhaps the most emphatic support for ACDM 
comes from fits to the acoustic power spectrum of temperature fluctuations in the CMB |230J . 

For a brief review of the basics and successes of the concordance cosmological model we refer 
the reader to, e.g., [5511351] and all references therein. We note that, while most of the cosmological 
probes in the above list are not uniquely fit by the ACDM model on their own, when tey are taken 
together they provide a remarkably tight set of constraints. The success of this now favoured 
cosmological model on large scales is thus remarkable indeed, as there was a priori no reason that 
such a parameterized cosmology could explain all these completely independent data sets with such 
outstanding consistency. 

In this model, the Hubble constant is Hq = 70 kms^^Mpc ^ (i.e., h = 0.7), the amplitude 
of density fluctuations within a top-hat sphere of 8h^^ Mpc is erg = 0.8, the optical depth to 
reionization is r = 0.08, the spectral index measuring how fluctuations change with scale is = 

0. 97, and the price we pay for the outstanding success of the model is new physics in the form of 
a dark sector. This dark sector is making up 95% of the mass-energy content of the Universe in 
ACDM: it is composed separately of a dark energy sector and a cold dark matter sector, which we 
briefly describe below. 

3.1 Dark Energy (A) 

In ACDM, dark energy is a non- vanishing vacuum energy represented by the cosmological constant 
A in the field equations of General Relativity. Einstein's cosmological constant is equivalent to 
vacuum energy with equation of state p/ p = w = —I. In principle, the equation of state could 
be merely close to, but not exactly w — —I. In this case, the dark energy could evolve and 
clump, depending on the value of w and its evolution w. However, to date, there is no compelling 
observational reason to require any form of dark energy more complex than the simple cosmological 
constant introduced by Einstein. 

The various observational datasets discussed above constrain the ratio of the dark energy density 
to the critical density to be JIa = A/3i/o — 0.73, where Hq is Bubble's constant and A is expressed 
in s~^. This value, together with the matter density flm (see below), leads to a total = flA+^m = 

1, i.e., a spatially fiat Euclidean geometry in the Robertson- Walker sense that is nicely consistent 
with the expectations of inflation. It is important to stress that this model relies on the cosmological 
principle, i.e., that our observational location in the Universe is not special, and on the fact 

^Arguably a non-static, expanding or contracting Universe was an a priori prediction of General Relativity in 
its original form lacking the cosmological constant. 
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that on large scales, the Universe is isotropic and homogeneous. For possible challenges to these 
assumptions and their consequences, we refer the reader to, e.g., [Ml 14881 l489j . 

3.2 Cold Dark Matter (CDM) 

In ACDM, dark matter is assumed to be made of non-baryonic dissipationless massive particles [49] . 
the so-called "cold dark matter" (CDM). This dark matter outweighs the baryons that participate 
in BBN by about 5:1. The density of baryons from the CMB is fib = 0.046, grossly consistent with 
BEN \230\ . This is a small fraction of the critical density; with the non-baryonic dark matter the 
total matter density is r2,„ = ^cdm + — 0.27. 

The "cold" in cold dark matter means that CDM moves slowly so that it is non-relativistic when 
it decouples from photons. This allows it to condense and begin to form structure while the baryons 
are still electromagnetically coupled to the photon fluid. After recombination, when protons and 
electrons first combine to form neutral atoms so that the cross-section for interaction with the 
photon bath suddenly drops, the baryons can fall into the potential wells already established by 
the dark matter, leading to a hierarchical scenario of structure formation with the repeated merger 
of smaller CDM clumps to form ever larger clumps. 

Particle candidates for the CDM must be massive, non-baryonic, and immune to electromag- 
netic interactions. The currently preferred CDM candidates are Weakly Interacting Massive Par- 
ticles (WIMPs, j47l |48] |49] ) that condensed from the thermal bath of the early Universe. These 
should have masses on the order of about 100 GeV so that (i) the free-streaming length is small 
enough to create small-scale structures as observed (e.g., dwarf galaxies), and (ii) that thermal 
relics with cross-sections typical for weak nuclear reactions account for the right amount of matter 
density ftm (see, e.g., Eq. 28 of f49^). This last point is known as the WIMP miracleH. 

For lighter particle candidates (e.g., ordinary neutrinos or light sterile neutrinos), the damping 
scale becomes too large. For instance, a hot dark matter (HDM) particle candidate with mass of 
a few to 15 eV would have a free-streaming length of about ~ 100 Mpc, leading to too little power 
at the small-scale end of the matter power spectrum. The existence of galaxies at redshift z ~ 6 
implies that the coherence length should have been smaller than 100 kpc or so, meaning that even 
warm dark matter (WDM) particles with masses between 1 and 10 keV are close to being ruled 
out as well (see, e.g., |349| ). ACDM thus presently remains the state-of-the-art in Cosmology, 
although some of the challenges listed below in Sect. |3]are leading to a slow drift of the standard 
concordance model from CDM to WDM [253] . but this drift brings along its own problems, and 
fails to address most of the current observational challenges summarized in the following section, 
which might thus perhaps point to a more radical alternative to the model. 



^The WIMP miracle seems however to fade away with modern particle physics constraints [23) . 
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4 Some Challenges for the ACDM Model 



The great concordance of independent cosmological observables from Gpc to Mpc scales lends a 
certain air of inevitability to the ACDM model. If we accept these observables as sufficient to 
prove the model, then any discrepancy appears as trivia that will inevitably be explained away. 
If instead we require a higher standard, such as positive laboratory evidence for the dark sectors, 
then ACDM appears as a yet unproven hypothesis that relies heavily on two potentially fictitious 
invisible entities. An important test of ACDM as a scientific hypothesis is thus the existence of 
dark matter. By this we mean not just unseen mass, but specifically CDM: some novel form 
of particle with the right microscopic properties and correct cosmic mass density. Searches for 
WIMPs are now rather mature and not particularly encouraging. Direct detection experiments 
have as yet no positive detections, and have now excluded 119] the bulk of the parameter space 
(interaction cross-section and particle mass) where WIMPs were expected to reside. Indirect 
detection through the observation of 7-rays produced by the self-annihilatior0 of WIMPs in the 
Galactic halo and in nearby satellite galaxies have similarly returned null results (SJ [53 1173] at 
interestingly restrictive levels. For the most plausible minimally supersymmetric models, particle 
colliders should already have produced evidence for WIMPs [U [21 [23] . The right model need not 
be minimal. It is always possible to construct a more complicated model that manages to evade all 
experimental constraints. Indeed, it is readily possible to imagine dark matter candidates that do 
not interact at all with the rest of the Universe except through gravity. Though logically possible, 
such dark matter candidates are profoundly unsatisfactory in that they could not be detected in 
the laboratory: their hypothesized existence could neither be confirmed nor falsified. 

Apart from this current non-detection of CDM candidates, there also exists prominent observa- 
tional challenges for the ACDM model, which might point towards the necessity of an alternative 
model (or, at the very least, an improved one). These challenges are that (i) some of the parameters 
of the model appear fine-tuned (Sect 4.1), and that (ii) as far as galaxy formation and evolution 
are concerned (mainly processes happening on kpc scales so that the predictions are more difficult 
to make because the baryon physics should play a more prominent role), many predictions that 
have been made were not successful (Sect. 4.2); (iii) what is more, a number of observations on 
these galactic scales do exhibit regularities that are fully unexpected in any CDM context without 
a substantial amount of fine-tuning in terms of baryon feedback (Sect. 4.3). 

4.1 Coincidences 

What is generally considered as the biggest problem for the ACDM model is that it requires a large 
and still unexplained fine-tuning to reduce by 120 orders of magnitude the theoretical expectation of 
the vacuum energy to yield the observed cosmological constant value, and, even more importantly, 
that it faces a coincidence problem to explain why the dark energy density Oa is precisely of the 
same order of magnitude as the other cosmological components todojfl. This uncanny coincidence 
is generally seen as evidence for some yet-to-discover underlying cosmological mechanism ruling 
the evolution of dark energy (such as quintessence or generalized additional fluid components, see, 
e.g., [107] ). But it could also indicate that the effect attributed to dark energy is rather due to a 
breakdown of General Relativity (GR) on the largest scales [159] . 

Then, as we shall see in more detail in Sect. 4.3, another coincidence, which is central to 
this whole review, is the appearance of a characteristic scale - dubbed oq - in the behavior of 
the dark matter sector, a scale with units of acceleration. This acceleration scale appears in 
various seemingly unrelated galactic scaling relations, mostly unpredicted by the ACDM model 

^The simplest WIMPs are their own antiparticle. 

^In addition, the time-averaged value of the deceleration parameter q over the present age of the Universe is 
quite consistently (g) = 14741 , another currently unexplained coincidence. 
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(see Sect. 4.3). The value of this scale is ao — 10^^° ms~^, which yields in natural unitH, ao ^ Hq 
(or, more precisely, ao ~ cHq/2tt). It is perhaps even more meaningful [5^ 12991 1305J to note that, 
in these same units: 

al ^ A, (1) 

where A is the currently favored value of the cosmological constanll^. Whether these numerical 
coincidences are physically relevant or just true (insignificant) coincidences remains an open ques- 
tion, closely related to the nature of the dark sector, which we are going to elaborate on in the next 
sections. But, at this stage, it is in any case striking that the dark matter and dark energy sectors 
do have such a common scale. This coincidence of scales, together with the coincidence of energy 
densities at redshift zero, might perhaps be a strong indication that one should cease to consider 
dark energy as an additional component physically independent from the dark matter sector [7j, 
and/or cease to consider that GR correctly describes gravity on the largest scales and in extremely 
weak gravitational fields, in order to perhaps address the two above coincidence problems at the 
same time. 

Finally, let us note that the existence of the ag-scale is actually not the only dark matter-related 
coincidence, as there is also in principle absolutely no reason why the mechanism leading to the 
baryon asymmetry (between baryonic matter and antimatter) would simultaneously leave both the 
baryon and dark matter densities with a similar order of magnitude (r^DM/f^b = 5). If the effects 
we attribute to dark matter are actually also due to a breakdown of GR on cosmological scales, 
then such a coincidence might perhaps appear more natural as the baryons would then be the 
actual source of the effect attributed to the dark matter sector. 

4.2 Unobserved predictions 

Apart from the above puzzling coincidences, the concordance ACDM model also has a few more 
concrete empirical challenges to address, in the sense of having made a few predictions in contradic- 
tion with observations (with the caveat in mind that the model itself is not always that predictive 
on small scales). These include the following non-exhaustive list: 

1. The bulk flow challenge. Peculiar velocities of galaxy clusters are predicted to be of the 
order of 200 km/s in the ACDM model. These can actually be measured by studying the 
fiuctuations in the CMB generated by the scattering of the CMB photons by the hot X-ray- 
emitting gas inside clusters (the kinematic SZ effect). This yields an observed coherent bulk 
flow of order 1000 km/s (5 times more than predicted) on scales out to at least 400 Mpc [222] . 
This bulk flow challenge appears not only in SZ studies but also in galaxy studies [484] . 
A related problem is the collision velocity larger than 3100 km/s for the merging bullet 
cluster 1E0657-56 at z = 0.3, much too high to be accounted for by ACDM [553 HMj. 
These observations would seem to indicate that the attractive force between DM particles 
is enhanced compared to what ACDM predicts, and changing CDM into WDM would not 
solve the problem. 

2. The high-z clusters challenge. Observation of even a single massive cluster at high red- 
shift can falsify ACDM [332] . In this respect the existence of the galaxy cluster XMMU J2235.3- 
2557 |,369j with a mass of of ~ 4 x 10^^ Mq at z = 1.4, even though not sufficient to rule out 

8c = G = ft = l. 

^ We have that A^ap/c^if expressed in inverse time-squared or A ~ /c^ if expressed in inverse length-squared 

(more precisely, the natural scale associated with the cosmological constant is ag ~ (c^/27r) ^/(A/S)). Another way 
of expressing this coincidence is thus to say that predictions of GR from visible matter alone always break down for 
physics involving a length-scale constant of the order of the Hubble radius I ~ A~^/^ ~ c^/aQ. This scale I could 
perhaps play a similar role to the Planck scale Ip |234| |44| , at the other end of the ladder (as we have I 10^''^ I p). 
This is however not the length at which the modification would be seen, exactly as quantum mechanics does not 
depart from classical physics at a given length. 
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the model, is very surprising and could indicate that structure formation is actually taking 
place earlier and faster than in ACDM (see also [421] on the Shapley supercluster and the 
Sloan Great Wall). 

3. The Local Void challenge. The Local Volume is composed of 562 known galaxies at 
distances smaller than 8 Mpc from the center of the Local Group, and the region known 
as the "Local Void" hosts only 3 of them. This is much less than the expected ^ 20 for 
a typical similar void in ACDM [350) . What is more, in the Local Volume, large luminous 
galaxies are over- represented by a factor of 6 in the underdense regions, exactly opposite to 
what is expected from AGDM. This could mean that the Local Volume is just a statistical 
anomaly, but it could also point, in line with the two previous challenges, towards more rapid 
structure formation, allowing sparse regions to more quickly form large galaxies cleaning their 
environment, making the galaxies larger and the voids emptier at early times [350]. 

4. The missing satelHtes challenge. It has long been known that the model predicts an 
overabundance of dark subhalos orbiting Milky Way-sized galaxies compared to the observed 
number of satellite galaxies around the Milky Way [330] . This is a different problem from 
the above predicted overabundance of small galaxies in voids. It has subsequently been 
suggested that stellar feedback and heating processes limit baryonic growth, that re-ionisation 
prevents low-mass dark halos from forming stars, and that tidal forces from the host halo 
limit growth of the dark matter sub-halos and lead to their truncation. This important 
theoretical effort has led recent semi-analytic models to predict a reduced number of ~ 100 
to 600 faint satellites rather than the original thousands. Moreover, during the past 15 years 
13 "new" and mostly ultra- faint satellite galaxies have been found in addition to the 11 
previously known classical bright ones. Since these new galaxies have been largely discovered 
with the Sloan Digital Sky Survey (SDSS), and since this survey covered only one fifth of 
the sky, it has been argued that the problem was solved. However, there are actually still 
missing satellites on the low mass and high mass end of the mass function predicted by 
"ACDM-|-re-inoisation" semi-analytic models. This is best illustrated on Figure 2 of [240] 
showing the cumulative distribution for the predicted and observationally derived masses 
within the central 300 pc of Milky Way satellites. A lot of low-mass satellites are still missing, 
and the most massive predicted subhaloes are also incompatible with hosting any of the known 
Milky Way satellites [TU [75] [73] • This is thus the modern version of the missing satellites 
challenge. An obvious but rather discomforting way-out would be to simply state that the 
Milky Way must be a statistical outlier, but this is contradicted by the study of [448] on 
the abundance of bright satellites around Milky Way-like galaxies in SDSS. Another solution 
would be to change from CDM to WDM [253] (it is actually one of the only listed challenges 
that such a change would probably immediately solve). 

5. The satellites phase-space correlation challenge. In addition to the above challenge, 
the distribution of dark subhalos around the Galaxy is also predicted by ACDM to be 
isotropic, or quasi-isotropic. However, the Milky Way satellites are currently observed to be 
correlated in phase-space: they lie within a seemingly rotation-supported disk [240) . Young 
halo globular clusters define the same disk, and streams of stars and gas, tracing the orbits of 
the objects from which they are stripped, preferentially lie in this disk, too [348] . Since SDSS 
covered only one fifth of the sky, it will be interesting to see whether future surveys such 
as Pan-Starrs will confirm this state of affairs. Whether or not this phase-space correlation 
would be unique to the Milky Way should also be carefully checked, the evidence in M31 be- 
ing currently much less convincing, with a richer and more complex satellite population [290] . 
But in any case, the current distribution of satellites around the Milky Way is statistically 
incompatible with the predictions of ACDM at a very high level of confidence even when 
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taking into account the observational bias from SDSS ^240) . While this might perhaps have 
been explained by the infall of a small group of galaxies that would have retained correlated 
orbits, this solution is ruled out by the fact that no nearby groups are observed to be any- 
where near as spatially small as the disk of satellites |291| . Another solution might be that 
most Milky Way satellites are actually not primordial galaxies but old tidal dwarf galaxies 
created in an early major merger event, accounting for their presently correlated phase-space 
distribution [347] . Note in passing that if only one or two long-lived tidal dwarfs are cre- 
ated in each gas-dissipational galaxy encounter, they could probably account for most of the 
dwarf galaxy population in the Universe, leaving no room for small CDM subhalos to cre- 
ate galaxies, which would transform the missing satellites challenge into a missing satellites 
catastrophe [240] . 

6. The cusp-core challenge. Another long-standing problem of ACDM is the fact that the 
simulations of the collapse of CDM halos lead to a density distribution as a function of 
radius, p{r), which is well fitted by a smooth function asymptoting to a central cusp with 
slope d\np/d\nr = — 1 in the central parts [12711333] . while observations clearly point towards 
large constant density cores in the central parts [1191 I17Q[ 1480] . Even though the latest 
simulations |334j rather point towards Einasto |134] profiles with dlnp/dlnr oc —A^^"^") (with 
n slightly varying with halo mass, and n ~ 6 for a Milky Way-sized halo, meaning that the 
slope is zero only very close to the nucleus [178) . and is still ~ — 1 at 200 pc from the center), 
fitting such profiles to observed galactic kinematical data such as rotation curves [53] leads 
to values of n that are much smaller than simulated values (meaning that they have much 
larger cores), which is another way of re-assessing the old cusp problem of ACDM. Note that 
a change from CDM to WDM could solve the problem in dwarf galaxies, by leading to the 
formation of small cores, but certainly not in large galaxies where large cores are needed from 
observations. One thus has to rely on baryon feedback to erase the cusp from all galaxies. 
But this is not easily done, as the adiabatic cooling of baryons in the centre of dark matter 
halos should lead to an even more concentrated dark matter distribution. A possibility would 
be that angular momentum transfer from a rotating stellar bar destroys dark-matter cusps: 
however, significant cusp destruction requires substantially more angular momentum than 
is realistically available in stellar bars [5ni 1287] . Note also that not all galaxies are barred 
(e.g., M33 is not). The state-of-the-art solution nowadays is to enforce strong supernovae 
outflows that move large amounts of low-angular-momentum gas from the central parts and 
that "pull" on the central dark matter concentration to create a core |177| . but this is still 
a highly fine-tuned process which fails to address the baryon fraction problem (see challenge 
10 below). 

7. The angular momentum challenge. As a consequence of the merger history of galaxy 
disks in a hierarchical formation scenario, as well as of the associated transfer of angular 
momentum from the baryonic disk to the dark halo, the specific angular momentum of 
the baryons ends up being much too small in simulated disks, which in turn end up much 
smaller than the observed ones [3]. Similarly, elliptical systems end up too concentrated too. 
Addressing this challenge within the standard paradigm essentially relies on forming disks 
through late-time quiescent gas accretion from large-scale filaments, with much less late-time 
mergers than presently predicted in ACDM. 

8. The pure disk challenge. Related to the previous challenge, large bulgeless thin disk 
galaxies are extremely difficult to produce in simulations. This is because major mergers, at 
any time in the galaxy formation process, typically create bulges, so bulgeless galaxies would 
represent the quiescent tail of a distribution of merger histories for galaxies of the Local 
Volume. However, these bulgeless disk galaxies represent more than half of large galaxies 
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(with Vc > 150 km/s) in the Local Volume |179| I232j . Solving this problem would rely, e.g., 
on suppressing central spheroid formation for mergers with mass ratios lower than 30% |229) . 

9. The stability challenge. Round CDM halos tend to stabilize very low surface density 
disks against the formation of bars and spirals, due to a lack of disk self-gravity |292j . The 
observation |282| of Low Surface Brightness (LSB) disk galaxies with strong bars and spirals 
is thus challenging in the absence of a significant disk component of dark matter. What is 
more, in the absence of such a disk DM component, the lack of disk self-gravity prevents the 
creation of very large razor thin LSB disks, but these are observed 223] 1261] . In the standard 
context, these observations would tend to point towards an additional disk DM component, 
either a CDM-one linked to in-plane accretion of satellites or a baryonic one in the form of 
molecular gas. 

10. The missing baryons challenge(s). As mentioned above, constraints from the CMB imply 
rim — 0.27 and flh = 0.046. However, our inventory of known baryons in the local Universe, 
summing over all observed stars, gas, etc., comes up short of the total. For example, [43] 
estimate that the sum of stars and cold gas is only ~ 5% of fib. While there now seems to be 
a good chance that many of the missing baryons are in the form of highly ionized gas in the 
warm-hot intergalactic medium (WHIM), we are still far from being able to give a confident 
account of where all the baryons reside. Indeed, there could be multiple distinct reservoirs in 
addition to the WHIM, each comparable to the mass in stars, within the current uncertainties. 
But there is another missing baryons challenge, namely the halo-by-halo missing baryons. 
Indeed, each CDM halo can, to a first approximation, be thought of as a microcosm of the 
Whole. As such, one would naively expect each halo to have the same baryon fraction as the 
whole Universe, /{, — rif,/f7,„ — 0.17. On the scale of clusters of galaxies, this is approximately 
true (but still systematically low), but for individual galaxies, observations depart from this 
in a systematic way which we have yet to understand, and which has nothing to do with the 
truncation radius. The ratio of the galaxy-detected baryon fraction over the cosmological 
one, fd, is plotted as a function of the potential well of the systems in Figure [2] [284' . There 
is a clear correlation, less massive objects being much more dark matter dominated than 
massive ones. This correlation is a priori not predicted at all by ACDM, at least not with 
the correct shape [274) . This missing baryons challenge is actually closely related to the 
baryonic TuUy-Fisher relation, which we expand on in the following Sect. 4.3.1. 

Let us however note that, while challenges 1 to 3 are not real smoking guns yet for the ACDM 
model, challenges 4 to 10 are concerned with processes happening on kpc scales, for which it is 
fair to consider that the model is not very predictive because the baryon physics should play a 
more important role, and this is hard to take into account rigorously. However, it is not sufficient 
to qualitatively invoke handwavy baryon physics to avoid confronting predictions of ACDM with 
observations. It is also mandatory to show that the feedback from the baryons which is needed to 
solve the observational problems is what would quantitatively happen in a physical galaxy. This, 
presently, is not the case yet for the aforementioned challenges. However, these challenges are 
"model-dependent problems" , in the sense of being failed predictions of a given model, but would 
not have appeared a priori surprising without the standard concordance model at hand. This 
means that subtly changing some parameters of the model (like, e.g., swapping CDM for WDM, 
making DM more self-interacting, etc.) might help solving at least a few of them. But what is 
even more challenging is a set of observations that appear surprising independently of any specific 
dark matter model, as they involve a fine-tuned relation between the distribution of visible and 
dark matter. These are what we call hereafter "unpredicted observations" . 
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Figure 2: The fraction of the expected baryons that are detected as a function of potential well 
depth (bottom axis) and mass (top). Measurements are referenced to the radius i?5oo where 
the enclosed density is 500 times the cosmic mean [284]. The detected baryon fraction fd — 
Mb/ (O.I7M500) where Mb is the detected baryonic mass, 0.17 is the universal baryon fraction 
|230j . and M500 is the dynamical mass (baryonic + dark mass) enclosed by i?5oo. Each point is 
a bin representing many objects. Gray triangles represent galaxy clusters, which come close to 
containing the cosmic fraction. The detected baryon fraction declines systematically for smaller 
systems. Dark blue circles represent star dominated spiral galaxies. Light blue circles represent 
gas dominated disk galaxies. Orange squares represent Local Group dwarf satellites for which the 
baryon content can be less than 1% of the cosmic value. Where these missing baryons reside is 
one of the challenges currently faced by ACDM. 
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4.3 Unpredicted observations 

There are several important examples of systematic relations between the dynamics of galaxies (in 
theory presumed to be dominated by dark matter) and their baryonic content. These relations 
are fully empirical, and as such must be explained by any viable theory. As we shall see, they 
inevitably involve a critical acceleration scale, or equivalently, a critical surface density of baryonic 
matter. 

4.3.1 Baryonic Tully Fisher relation 

One of the strongest correlations in extragalactic astronomy is the TuUy-Fisher relation [468] . 
Originally identified as an empirical relation between a galaxy's luminosity and its HI line-width, 
it has been extensively employed as a distance indicator. Though extensively studied for decades, 
the physical basis of the relation remains unclear. 

Luminosity and line-width are readily accessible observational quantities. The optical lumi- 
nosity of a galaxy is a proxy for its stellar mass, and the HI line-width is a proxy for its rotation 
velocity. The quality of the correlation improves as more accurate indicators of these quantities 
are employed. For example, resolved rotation curves where the flat portion of the rotation curve 
Vf or the maximum peak velocity Vp can be measured give relations that are tighter than those 
utilizing only line- width information [109] . Similarly, the scatter declines as we shift from optical 
luminosities to those in the near-infrared [476] as the latter are expected to give a more reliable 
mapping of starlight to stellar mass [43) . 

It was then realized [323[ I158[ I283j that a more fundamental relation was that between the 
total observed baryonic mass and the rotation velocity. In most bright galaxies, the stars harbor 
the majority of the detected baryonic mass, so luminosity suffices as a proxy for mass. The next 
most important known reservoir of baryons is the neutral atomic hydrogen (HI) of the interstellar 
medium. As studies have probed down the mass spectrum to lower mass, more slowly rotating 
systems, a higher preponderance of gas rich galaxies is found. The luminous TuUy-Fisher relation 
breaks down [283ll273j . but a tight relation persists if instead of luminosity, the detected baryonic 
mass Mb = Mg is used ME SSI 12121 13M1 131 SH EMI [278,. This is the Baryonic 

TuUy-Fisher Relation (BTFR), plotted on Figured 

The luminous TuUy-Fisher relation extends over about two decades in luminosity. Recent work 
extending the relation to low mass, typically low surface brightness and gas rich galaxies [32l 14461 
I463J extends the dynamic range of the BTFR to five decades in baryonic mass. Over this range, the 
BTFR has remarkably little intrinsic scatter (consistent with zero given the observational errors) 
and is well described as a power law, or equivalently, as a straight line in log-log space: 



with slope a = 4 [2731 14461 1278] . This slope is consistent with a constant acceleration scale 



The acceleration scale a « 10 ms ^ ^ A^/^ (Eq. [TJ is thus present in the data. Figure |4] 
shows the distribution of this acceleration V^/Mi,, around the best fit line in Figure [3l strongly 
peaked around ~ 2x 10~^^ in natural units. As we shall see, this acceleration scale arises empirically 
in a variety of distinct situations involving the mass discrepancy problem. 

As we shall see (Sect. 5 and 6), MOND was constructed to predict a relation ao = Vj / (GM) for a point mass 
M (note that the slope of 4 is however a pure consequence of the acceleration base, it is not possible to get an 
arbitrary slope from such an idea). Since spiral galaxies are not point masses but rather flattened mass distributions 
that rotate faster than the equivalent spherical mass distribution 53 , the empirical acceleration a is close to but 
not identical to ao in MOND. The geometric correction is about 20% so that ag = 0.8a |273i . 



log Mf, = a log Vf — log P 



(2) 
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Figure 3: The Baryonic Tully-Fisher (mass-rotation velocity) relation for galaxies with well mea- 
sured outer velocities Vf. The baryonic mass is the combination of observed stars and gas: 
Mb = Mjf+Mg. Galaxies have been selected that have well observed, extended rotation curves from 
21 cm interferrometric observations providing a good measure of the outer, flat rotation velocity. 
The dark blue points are galaxies with M* > Mg [273] . The light blue points have M.^. < Mg |278j 
and are generally less precise in velocity, but more accurate in terms of the harmlessness on the 
result of possible systematics on the stellar mass-to-light ratio. For a detailed discussion of the 
stellar mass-to-light ratios used here, see |273l 1278] . The dotted line has slope 4 corresponding to 
a constant acceleration parameter, 1.2 x 10""'^° ms~^. The dashed line has slope 3 as expected in 
ACDM with the normalization expected if all of the baryons associated with dark matter halos 
are detected. The difference between these two lines is the origin of the variation in the detected 
baryon fraction in Figure [2j 
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Figure 4: Histogram of the accelerations a — V^/{GMi,) in m s ^ (bottom axis) and natural units 
[c* /{Gmp) where mp is the Planck mass] for galaxies with well measured Vf. The data are peaked 
around a characteristic value of ~ 10^^° ms~^ (~ 2 x 10"^^ in natural units). 



A BTFR of the observed form does not arise naturally in ACDM. The naive expectation is 
a = 3 and /3 = lOfyGHo [44 7p^ where Hq is the Hubble constant and fy is a factor of order 
unity (currently estimated to be ~ 1.3 j362j ) that relates the observed Vf to the circular velocity 
of the potential at the virial radiuJ^. This modest fudge factor is necessary because ACDM does 
not explicitly predict either axis of the observed BTFR. Rather, there is a relationship between 
total (baryonic plus dark) mass and rotation velocity at very large radii. This simple scaling fails 
(dashed line in Figure [3]) , obliging us to introduce an additional fudge factor fd [2741 1284] that 
relates the detected baryonic mass to the total mass of baryons available in a halo. This mismatch 
drives the variation in the detected baryon fraction fd seen in Figure [2j A constant fd is excluded 
by the difference between the observed and predicted slopes; fd must vary with Vf, or AI, or the 
gravitational potential $. 

This brings us to the first fine-tuning problem posed by the data. There is essentially zero 
intrinsic scatter in the BTFR 278 while the detected baryon fraction fd could in principle obtain 
any value between zero and unity. Somehow galaxies must "know" what the circular velocity of 
the halo they reside in is so that they can make observable the correct fraction of baryons. 

Quantitatively, in the ACDM picture, the baryonic mass plotted in the BTFR (Figure |3]) is 

^^The factor 10 arises from the commonly adopted definition of the virial radius of the dark matter halo at an 
overdensity of 200 times the critical density of the Universe |333) . 

Note that |181| claimed to measure a slope of 3 for the BTFR, but they relied on unresolved line-widths 
from single dish 21 cm observations to estimate rotation velocity rather than measuring Vf from resolved rotation 
curves. Line- widths give a systematically different estimate of the slope of the BTFR than Vf, even for the same 
galaxies | 277l 13411 14761 . and they cannot be related at all to the circular velocity of the potential at the virial radius, 
nor to the prediction of MOND (Sect. 5 and 6). 
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Mb — M» + Mg while the total baryonic mass available in a halo is fbMtot- The difference 
between these quantities implies a reservoir of dark baryons in some undetected form, Mother- It 
is commonly speculated that the undetected baryons could be in a hard-to-detect hot, diffuse, 
ionized phase mixed in with the dark matter halo (and extending to comparable radius), or that 
the missing baryons have been entirely blown away by winds from supernovae. For the purposes 
of this argument, it does not matter which form the dark baryons take. All that matters is that a 
substantial mass of them are required so that [283] 



fd 



Mb 



fbMu 



+M„+ Mothe 



(3) 



Since there is negligible intrinsic scatter 
scatter in fd- By inspection of Eq. [3l it is 
naturally in the limits + Mg ^ Moth 
Neither of these limits apply. We require 
but we need the fractional mass of these 
rotation velocity Vf . Put another way, for 
we see, but also how many we do not see - 



in the observed BTFR, there must be effectively zero 
apparent that small scatter in fd can only be obtained 
[. so that — 1 or Af* + Mg <^ A/other so that fd — !• 0. 
not only an appreciable mass in dark baryons AiTothor, 
missing baryons to vary in lockstep with the observed 
any given galaxy, we know not only how many baryons 
— a remarkable feat of non-observation. 




Figure 5: Residuals (JlogV/) from the baryonic Tully-Fisher relation as a function of a galaxy's 
characteristic baryonic surface density (E^ — 0.75AIb/ [272) . Rp being the radius at which the 
contribution of baryons to the rotation curve peaks). Color differentiates between star (dark blue) 
and gas (light blue) dominated galaxies as in Figure |3l but not all galaxies there have sufficient 
data (especially of Rp) to plot here. Stellar masses have been estimated with stellar population 
synthesis models [l^. More accurate data, with uncertainty on rotation velocity less than 5%, 
are shown as larger points; less accurate data are shown as smaller points. The rotation velocity 
of galaxies shows no dependence on the distribution of baryons as measured by Ef, or Rp. This 
is puzzling in the conventional context, where — GM/r should lead to a strong systematic 
residual [llOj . 

Another remarkable fact about the BTFR is that it shows no residuals with variations in 
the distribution of baryons f 518[ I444[ I110[ 1272] . Figure [5] shows deviations from the BTFR as 
a function of the characteristic baryonic surface density of the galaxies, as defined in [272], i.e., 
Ef, ~ 0.75Mb/ Rp where Rp is the radius at which the rotation curve Vb{r) of baryons peaks. Over 
several decades in surface density, the BTFR is completely insensitive to variations in the mass 
distribution of the baryons. This is odd because, a priori, ~ M/R, and thus ~ A/E. 



21 



Yet the BTFR is Mb ^ with no dependence on E. This brings us to a second fine-tuning 
problem. For some time, it was thought |157| that spiral galaxies all had very nearly the same 
surface brightness (a condition formerly known as "Freeman's Law"). If this is indeed the case, 
the observed BTFR naturally follows from the constancy of S. However, there do exist many 
low surface brightness galaxies |265j that violate the constancy of surface brightness implied in 
Freeman's Law. One would thus expect them to deviate systematically from the Tully-Fisher 
relation, with lower surface brightness galaxies having lower rotation velocities at a given mass. 
Yet they do not. Thus one must fine-tune the mass surface density of the dark matter to precisely 
make up for that of the baryons [279] . As the surface density of baryons declines, that of the dark 
matter must increase just so as to fill in the difference (Figure [6] j272J ). The relevant quantity 
is the dynamical surface density enclosed within the radius where the velocity is measured. The 
latter matters little along the flat portion of the rotation curve, but the former is the sum of dark 
and baryonic matter. 

One might be able to avoid fine-tuning if all galaxies are dark matter dominated [llOj . In 
the limit Sdm ^ S^, the dynamics are entirely dark matter dominated and the distribution 
of the baryons is irrelevant. There is some systematic uncertainty in the mass-to-light ratios of 
stellar populations [43] . making such an approach a priori tenable. In effect, we return to the 
interpretation of S ~ constant originally made by [Sj in the context of Freeman's Law, but now 
we invoke a constant surface density of CDM rather than of baryons. But as we will see, such an 
interpretation, i.e., that Ef, ^ E_dm m all disk galaxies, is flatly contradicted by other observations 
(e.g.. Figure Eland Figure fT3|) . 

The Tully-Fisher relation is remarkably persistent. Originally posited for bright spirals, it ap- 
plies to galaxies that one would naively expected to deviate from it. This includes low luminosity, 
gas dominated irregular galaxies |446l 14631 1278) . low surface brightness galaxies of all luminosi- 
ties |5181 1444) , and even tidal dwarfs formed in the collision of larger galaxies |166j . Such tidal 
dwarfs may be especially important in this context (see also Sect. 6.5.4). Galactic collisions should 
be very effective at segregating dark and baryonic matter. The rotating gas disks of galaxies that 
provide the fodder for tidal tails and the tidal dwarfs that form within them initially have nearly 
circular, coplanar orbits. In contrast, the dark matter particles are on predominantly radial orbits 
in a quasi-spherical distribution. This difference in phase space leads to tidal tails that themselves 
contain very little dark matter ^3]. When tidal dwarfs form from tidal debris, they should be 
largely devoicT^ of dark matter. Nevertheless, tidal dwarfs do appear to contain dark matter [73] 
and obey the BTFR [166]. 

The critical acceleration scale of Eq. [T]also appears in non- rotating galaxies. Elliptical galaxies 
are three-dimensional stellar systems supported more by random motions than organized rotation. 
First of all, in such systems of measured velocity dispersion cr, the typical acceleration / R'ls also 
of the order of oq within a factor of a few, where R is the effective radius of the system [399] . More- 
over, they obey an analogous relation to the Tully-Fisher one, known as the Faber- Jackson relation 
(Figure |7|). In bulk, the data for these star-dominated galaxies follow the relation cr^/(GAf*) cx oq 
(dotted line in Figure [7|). This is not strictly analogous to the flat part of the rotation curves 
of spiral galaxies, the dispersion typically being measured at smaller radii where the equivalent 
circular velocity curve is often falling [368] 1324] . or in a temporary plateau before falling again 
(see also Sect. 6.6.1). Indeed, unlike the case in spiral galaxies where the distribution of stars 
is irrelevant, it clearly does matter in elliptical galaxies (the Faber-Jackson relation is just one 

The difference in phase space between gas and dark matter also prevents the accretion of tidal gas onto any 
dark matter sub-halos that may be present. It does not suffice for a tidal tail to intersect the location of a sub-halo 
in coordinate space, they must also dock in velocity space. The gas is moving at the characteristic velocity of the 
entire system (typically ~ 200 kms~^) which by definition exceeds the escape speed of typical sub-halos (usually 
< 100 kms~^). The odds of capture arc therefore effectively zero unless the tail and sub-halo happen to be on 
very nearly the same orbit initially, which is itself very unlikely because of the initial difference in their phase space 
distribution. 
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Figure 6: The fractional contribution to the total velocity Vp at the radius Rp where the contri- 
bution of the baryons peaks for both baryons {Vb/Vp, top) and dark matter {VoM/Vp, bottom). 
Points as per Figure [5l As the baryonic surface density increases, the contribution of the baryons 
to the total gravitating mass increases. The dark matter contribution declines in compensation, 
maintaining a see-saw balance that manages to leave no residual in the BTFR (Figure [5|) . The 
absolute amplitude of Vb and Vdm depends on choice of stellar mass estimator, but the fine-tuning 
between them must persist for any choice of M*/L. 



23 



projection of the "fundamental plane" of elliptical galaxies [86]). This is comforting: at small radii 
in dense stellar systems where the baryonic mass of stars is clearly important, the data behave as 
Newton predicts. 

The acceleration scale oq is clearly imprinted on the data for local galaxies. This is an empirical 
statement that might not hold at all times, perhaps evolving over cosmic time or evaporating 
altogether. Substantial efforts have been made to investigate the TuUy-Fisher relation to high 
redshift. To date, there is no persuasive evidence of evolution in the zero point of the BTFR out 
to z = 0.6 |357l I358J and perhaps even to z = 1 |486j . One must exercise caution in interpreting 
such results given the difficulty inherent in peering many Gyr back in cosmic time. Nonetheless, 
it appears that the scale ao remains present in the data and has not obviously changed over the 
more recent half of the age of the Universe. 

4.3.2 The role of surface density 

The Freeman limit |157j is the maximum central surface brightness in the distribution of galaxy 
surface brightnesses. Originally thought to be a universal surface brightness, it has since become 
clear that instead galaxies exist over a wide range in surface brightness [265) . In the absence 
of a perverse and fine-tuned anti-correlation between surface brightness and stellar mass-to-light 
ratio |518| . this implies a comparable range in baryonic surface density (Figure [8]). 

An upper limit to the surface brightness distribution is interesting in the context of disk stability. 
Recall that dynamically cold, purely Newtonian disks are subject to potentially self destructive 
instabilities, one cure being to embed them in the potential wells of spherical dark matter ha- 
los |344j . While the proper criterion for stability is much debated [132| 1416] it is clear that the 
dark matter halo moderates the growth of instabilities and that the ratio of halo to disk self gravity 
is a relevant quantity. The more self-gravitating a disk is, the more likely it is to suffer undamped 
growth of instabilities. But in principle, galaxies with a baryonic disk and a dark matter halo 
are totally scalable: if a galaxy model has a certain dynamics, and one multiplies all densities 
by any (positive) constant (and also scales the velocities appropriately) one gets another galaxy 
with exactly the same dynamics (with scaled time scales). So if one is stable, so is the other. In 
turn, the mere fact that there might be an upper limit to is a priori surprising, and even more 
so that there might be a coincidence of this upper limit with the acceleration scale uq identified 
dynamically. 

The scale E-f = ao/G is clearly present in the data (Figure [8]). Selection effects make high 
surface brightness galaxies easy to detect and hence discover, but their intrinsic numbers appear 
to decline exponentially when the central surface density of the stellar disk Eq > E^ |265] . It 
seems natural to associate the dynamical scale ao with the disk stability scale Ej since they are 
numerically indistinguishable and both arise in the context of the mass discrepancy. However, there 
is no reason to expect this in ACDM, which predicts denser dark matter halos than observed [280| 
[TTO] fT68] [242] [2411 EZSl EM- Such dense dark matter halos could stabilize much higher density 
disks than are observed to exist. Lacking a clear mechanism to specify this scale, it is introduced 
into models by hand [116'. 

Poisson's equation provides a direct relation between the force per unit mass (centripetal accel- 
eration in the case of circular orbits in disk galaxies) , the gradient of the potential, and the surface 
density of gravitating mass. If there is no dark matter, the observed surface density of baryons 
must correlate perfectly with the dynamical acceleration. If, on the other hand, dark matter dom- 
inates the dynamics of a system, as we might infer from Figure [5] [2 79 1 [TTO] . then there is no reason 
to expect a correlation between acceleration and the dynamically insignificant baryons. Figure [S] 
shows the dynamical acceleration as a function of baryonic surface density in disk galaxies. The 
acceleration Op = Vp/Rp is measured at the radius Rp where the rotation curve Vb{r) of baryons 
peaks. Given the systematic variation of rotation curve shape |377[I496] . the specific choice of radii 
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Figure 7: The Faber-Jackson relation for spheroidal galaxies, including both elliptical galaxies 
(red squares, [86| 1233) ) and Local Group dwarf satellites |285j (orange squares are satellites of 
the Milky Way; pink squares are satellites of M31). In analogy with the TuUy-Fisher relation 
for spiral galaxies, spheroidal galaxies follow a relation between stellar mass and line of sight 
velocity dispersion (cr). The dotted line represents a constant value of the acceleration parameter 
(7''/(GM,). Note however that this relation is different from the BTFR because it applies to the 
bulk velocity dispersion while the BTFR applies to the asymptotic circular velocity. In the context 
of Milgrom's law (Sect. 5 hereafter) the Faber-Jackson relation is predicted only when relying on 
assumptions such as isothermality, isotropy, and the slope of the baryonic density distribution (see 
3rd law of motion in Sect. 5.2). In addition, not all pressure-supported systems are in the weak- 
acceleration regime. So, in the context of Milgrom's law, deviations from the weak-field regime, 
from isothermality and from isotropy, as well as variations in the baryonic density distribution 
slope, would thus explain the scatter in this relation. 
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Figure 8: Size and surface density. The characteristic surface density of baryons as defined in 
Figure [5] is plotted against their dynamical scale length Rp in the left panel. The dark blue points 
are star-dominated galaxies and the light-blue ones gas-dominated. High characteristic surface 
densities at low Rp in the left panel are typical of bulge-dominated galaxies. The stellar disk 
component of most spiral galaxies is well approximated by the exponential disk with = 
Epe"^/^''. This disk-only central surface density and the exponential scale length of the stellar 
disk are plotted in the right panel. Galaxies exist over a wide range in both size and surface density. 
There is a maximum surface density threshold (sometimes referred to as Freeman's limit) above 
which disks become very rare [265] . This is presumably a stability effect, as purely Newtonian 
disks are unstable [3441 I416J . Stable disks only appear below a critical surface density w 
ao/G [8001178]. 
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Figure 9: The dynamical acceleration Up = / Rp in units of oq plotted against the characteristic 
baryonic surface density '276] . Points as per Figure [5j The dotted line shows the relation ap — 
GSfc that would be obtained if the visible baryons sufficed to explain the observed velocities in 
Newtonian dynamics. Though the data do not follow this line, they do show a correlation {ap oc 

1 /2 

). This clearly indicates a dynamical role for the baryons, in contradiction to the simplest 
interpretation |110j of Figure [5] that dark matter completely dominates the dynamics. 

is unimportant. Nevertheless, this radius is advocated to be used by |110j since this maximizes the 
possibility of perceiving the baryonic contribution in the plot of Figure El That this contribution 
is not present leads to the inference that E}, <C T,£,m in all disk galaxies [110) . This is directly 
contradicted by Figure |9l which shows a clear correlation between ap and E;,. 

The higher the surface density of baryons is, the higher the observed acceleration. The slope 
of the relation is not unity, ap oc Eb, as we would expect in the absence of a mass discrepancy, 
but rather Op oc E^^. To simultaneously explain Figure [5] and Figure [U there must be a strong 
fine-tuning between dark and baryonic surface densities (i.e., Figure[6]), a sort of repulsion between 
them, a repulsion which is however contradicted by the correlations between baryonic and dark 
matter bumps and wiggles in rotation curves (see Sect. 4.3.4). 

4.3.3 Mass discrepancy-acceleration relation 

So far we have discussed total quantities. For the BTFR, we use the total observed mass of 
a galaxy and its characteristic rotation velocity. Similarly, the dynamical acceleration-baryonic 
surface density relation uses a single characteristic value for each galaxy. These are not the only 
ways in which the "magical" acceleration constant ag appears in the data. In general, the mass 
discrepancy only appears at very low accelerations a < ao and not (much) above ap. Equivalently, 
the need for dark matter only becomes clear at very low baryonic surface densities E < Ej = ao/G. 
Indeed, the amplitude of the mass discrepancy in galaxies anti-correlates with acceleration i271j . 
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Figure 10: The mass discrepancy in spiral galaxies. The mass discrepancy is defined |271) as the 
ratio V^/Vi^ where V is the observed velocity and Vf, is the velocity attributable to visible baryonic 
matter. The ratio of squared velocities is equivalent to the ratio of total to baryonic enclosed mass 
for spherical systems. No dark matter is required when V ^ Vb, only when V > Vb- Many 
hundreds of individual resolved measurements along the rotation curves of nearly one hundred 
spiral galaxies are plotted. The top panel plots the mass discrepancy as a function of radius. No 
particular linear scale is favored. Some galaxies exhibit mass discrepancies at small radii while 
others do not appear to need dark matter until quite large radii. The middle panel plots the 
mass discrepancy as a function of centripetal acceleration a = V'^ /r, while the bottom panel plots 
it against the acceleration = ^j,^/'" predicted by Newton from the observed baryonic surface 
density E^. Note that the correlation appears a little better with because the data are strecthed 
out over a wider range in than in a. Note also that systematics on the stellar mass-to-light 
ratios can make this relation slightly more blurred than shown here, but the relation is nevertheless 
always present irrespective of the assumptions on stellar mass-to-light ratios [271] . There is thus a 
clear organization: the amplitude of the mass discrepancy increases systematically with decreasing 
acceleration and baryonic surface density. 
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In |271j . one examined the role of various possible scales, as well as the effects of different 
stellar mass-to-light ratio estimators, on the mass discrepancy problem. The amplitude of the 
mass discrepancy, as measured by (V/Vb)^, the ratio of observed velocity to that predicted by the 
observed baryons, depends on the choice of estimator for stellar M^/ L. However, for any plausible 
(non-zero) M^,/L, the amplitude of the mass discrepancy correlates with acceleration (Figure fTO|l 
and baryonic surface density, as originally noted in |402| 12671 1407] . It does not correlate with 
radius and only weakly with orbital frequencvP^. 

There is no reason in the dark matter picture why the mass discrepancy should correlate with 
any physical scale. Some systems might happen to contain lots of dark matter; others very little. 
In order to make a prediction with a dark matter model, it is necessary to model the formation of 
the dark matter halo, the condensation of gas within it, the formation of stars therefrom, and any 
feedback processes whereby the formation of some stars either enables or suppresses the formation 
of further stars. This complicated sequence of events is challenging to model. Baryonic "gastro- 
physics" is particularly difficult, and has thus far precluded the emergence of a clear prediction for 
galaxy dynamics from ACDM. 

ACDM does make a prediction for the distribution of mass in baryonless dark matter halos: 
the NFW halo [3331 1334] . These are remarkable for being scale free. Small halos have a profile 
similar to large halos. No feature stands out that marks a unique physical scale as observed. 
Galaxies do not resemble pure NFW halos (417j . even when dark matter dominates the dynamics 
as in low surface brightness galaxies |242II244T|119J . The inference in ACDM is that gastrophysics, 
especially the energetic feedback from stellar winds and supernova explosions, plays a critical role 
in sculpting observed galaxies. This role is not restricted to the minority baryonic constituents; it 
must also affect the majority dark matter [177] . Simulations incorporating these effects in a quasi- 
realistic way are extremely expensive computationally, so a comprehensive survey of the plausible 
parameter space occupied by such models has yet to be made. We have no reason to expect that 
a particular physical scale will generically emerge as the result of baryonic gastrophysics. Indeed, 
feedback from star formation is inherently a random process. While it is certainly possible for 
simple laws to emerge from complicated physics (e.g., the fact that SNIa are standard candles 
despite the complicated physics involved), the more common situation is for chaos to beget chaos. 
It therefore seems unnatural to imagine feedback processes leading to the orderly behavior that 
is observed (Figure llOp , nor is it obvious how they would implicate any particular physical scale. 
Indeed, the dark matter halos formed in ACDM simulations |333] 1334] provide an initial condition 
with greater scatter than the final observed one [280] 1479] . so we must imagine that the chaotic 
processes of feedback not only impart order, but do so in a way that cancels out some of the scatter 
in the initial conditions. 

In any case, and whatever the reason for it, a physical scale is clearly observationally present 
in the data: ag (Eq. [T]). At high accelerations a 3> ag, there is no indication of the need for dark 
matter. Below this acceleration, the mass discrepancy appears. It cannot be emphasized enough 
that the role played by aq in the BTFR and this role as a transition acceleration have strictly no 
intrinsic link with each other, they are fully independent of each other. There is nothing in ACDM 
that stipulates that these two relations (the existence of a transition acceleration and the BTFR) 
should exist at all, and even less that these should harbour an identical acceleration scale. 

It is thus important to realize not only that the relevant dynamical scale is one of acceleration, 
not size, but also that the mass discrepancy appears only at extremely low accelerations. Just as 
galaxies are much bigger than the Solar system, so too are the centripetal accelerations experienced 
by stars orbiting within a galaxy much smaller than those experienced by planets in the Solar 
system. Many of the precise tests of gravity that have been made in the Solar system do not explore 
the relevant regime of physical parameter space. This is emphasized in Figure \TT\ which extends 

^^Note that this correlation with acceleration was looked at notably because it was pointed to by Milgrom's law 
(see Sect. 5). 
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Figure 11: The mass discrepancy-acceleration relation from Figure [TUl extended to Solar system 
scales (each planet is labelled). This illustrates the large gulf in scale between galaxies and the 
Solar system where high precision tests are possible. The need for dark matter only appears at 
very low accelerations. 

the mass discrepancy-acceleration relation to Solar system scales. Many decades in acceleration 
separate the Solar system from galaxies. Aside from the possible exception of the Pioneer anomaly, 
there is no hint of a discrepancy in the Solar system: V = Vb. Even the Pioneer anomal\F^ is well 
removed from the regime where the mass discrepancy manifests in galaxies, and is itself much too 
subtle to be perceptible in Figure [TTl Indeed, to within of a factor of ^ 2, no system exhibits a 
mass discrepancy at accelerations a 3> oq. 

The systematic increase in the amplitude of the mass discrepancy with decreasing acceleration 
and baryonic surface density has a remarkable implication. Even though the observed velocity is 
not correctly predicted by the observed baryons, it is predictable from them. Independently of 
any theory, we can simply fit a function D{G'S) to then describe the variation of the discrepancy 
(V/Vb)'^ with baryonic surface density [271] . We can then apply it to any new system we encounter 
to predict V — D^^^Vb- In effect, D boosts the velocity already predicted by the observed baryons. 
While this is a purely empirical exercise with no underlying theory, it is quite remarkable that 
the distribution of dark matter required in a galaxy is entirely predictable from the distribution 
of its luminous mass (see also [168] ). In the conventional picture, dark matter outweighs baryonic 
matter by a factor of five, and more in individual galaxies given the halo-by-halo missing baryon 
problem (Figure [2]) , but apparently the baryonic tail wags the dark matter dog. And it does so 
again through the acceleration scale ag. Indeed, at very low accelerations, the mass discrepancy 
is precisely defined by the inverse of the square-root of the gravitational acceleration generated by 
the baryons in units of aq. This actually asymptotically leads to the BTFR. 

So, up to now, we have seen five roles of aq in galaxy dynamics, (i) It defines the zero point of 
the TuUy-Fisher relation, (ii) it appears as the characteristic acceleration at the effective radius of 
spheroidal systems, (iii) it defines the Freeman limit for the maximum surface density of pure disks, 
(iv) it appears as a transition-acceleration above which no dark matter is needed, and below which 
it appears, and (v) it defines the amplitude of the mass-discrepancy in the weak-field regime (this 
last point is not a fully independent role as it leads to the Tully-Fisher relation) . Let us eventually 
note that there is yet a final role played by oq, which is that it defines the central surface density 
of all dark matter halos as being of the order of ao/(27rG') [1301 11681 1314] . 

^^The Pioneer anomaly has an amplitude of the order of ^ 10"^ m s~-^ but appears at a location in the solar system 
where the total gravitational acceleration is ~ 10~® ms~^. The discrepancy in Figure [TTlis thus {V/Vtf fa 1.001. 
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4.3.4 Renzo's rule 

The relation between dynamical and baryonic surface densities appears as a global scaling relation 
in disk galaxies (Figure |9]) and as a local correspondence within each galaxy (Figure [10]). When all 
galaxies are plotted together as in Figure I10[ this connection appears as a single smooth function 
D(a). This does not sufHce to illustrate that individual galaxies have features in their baryon 
distribution that are reflected in their dynamics. While the above correlations could be interpreted 
as a sort of repulsion between dark and baryonic matter, the following rather indicates closer than 
natural attraction. 

Figure fT2] shows the spiral galaxy NGC 6946. Two multi-color images of the stellar component 
are given. The optical bands provide a (nearly) true color picture of the galaxy, which is perceptibly 
redder near the center and becomes progressively more blue further out. This is typical of spiral 
galaxies and reflects real differences in stellar content: the stars towards the center tend to be older 
an more dominated by the light of red giants, while those further out are younger on average so 
the light has a greater fractional contribution from bright but short-lived main sequence stars. The 
near-infrared bands [210] give a more faithful map of stellar mass, and are less affected by dust 
obscuration. Radio synthesis imaging of the 21 cm emission from the hydrogen spin- flip transition 
maps the atomic gas in the interstellar medium, which typically extends to rather larger radii than 
the stars. 

Surface density profiles of galaxies are constructed by fitting ellipses to images like those illus- 
trated in Figure [12] The ellipses provide an axisymmetric representation of the variation of surface 
brightness with radius. This is shown in the top panels of Figure [13] for NGC 6946 (Figure [12]) 
and the nearby, gas rich, low surface brightness galaxy NGC 1560. The X-band light distribution 
is thought to give the most reliable mapping of observed light to stellar mass (43j , and has been 
used to trace the run of stellar surface density in Figure [13] The sharp feature at the center is 
a small bulge component visible as the red central region in Figure 1121 The bulge contains only 
4% of the if-band light. The remainder is the stellar disk; a straight line fit to the data outside 
the central bulge region gives the parameters of the exponential disk approximation, Sq and Rd- 
Similarly, the surface density of atomic gas is traced by the 21 cm emission, with a correction for 
the cosmic abundance of helium - the detected hydrogen represents 75% of the gas mass believed 
to be present, with most of the rest being helium, in accordance with big bang nucleosynthesis. 

Mass models (bottom panels of Figure [T3]) are constructed from the surface density profiles 
by numerical solution of the Poisson equation |531 1473j . No approximations (like sphericity or 
an exponential disk) are made at this step. The disks are assumed to be thin, with radial scale 
length exceeding their vertical scale by 8:1, as is typical of edge-on disks [237] . Consequently, 
the computed rotation curves (various broken lines in Figure I13p are not smooth, but reflect the 
observed variations in the observed surface density profiles of the various components. The sum 
(in quadrature) leads to the total baryonic rotation curve Vb [r) (the solid lines in Figure [13]) : this 
is what would be observed if no dark matter were implicated. Instead, the observed rotation (data 
points in Figure [T5]) exceeds that predicted by Vb{r): this is the mass discrepancy. 

It is often merely stated that flat rotation curves require dark matter. But there is considerably 
more information in rotation curve data than asymptotic flatness. For example, it is common that 
the rotation curve in the inner parts of high surface brightness galaxies like NGC 6946 is well 
described by the baryons alone. The data are often consistent with a very low density of dark 
matter at small radii with baryons providing the bulk of the gravitating mass. This condition 
is referred to as maximum disk |472j . and also runs contrary to our inferences of dark matter 
dominance from Figure [5] 1415]. More generally, features in the baryonic rotation curve Vfc(r) often 
correspond to features in the total rotation Vc{r). 

Perhaps the most succinct empirical statement of the detailed connection between baryons and 
dynamics has been given by Renzo Sancisi, and known as Renzo's rule [380' : "For any feature in the 
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Figure 12: The spiral galaxy NGC 6946 as it appears in the optical (color composite from the 
BVR bands, left; image obtained by SSM with Rachel Kuzio de Naray using the Kitt Peak 2.1 
m telescope), near-infrared [JHK bands, middle ^210 ), and in atomic gas (21 cm radiaiton, 
right |482) ). The images are shown at the same physical scale, illustrating how the atomic gas 
typically extends to greater radii than the stars. Images like these are used to construct mass 
models representing the observed distribution of baryonic mass. 

luminosity profile there is a corresponding feature in the rotation curve. " Both galaxies illustrated 
in Figure [T3l illustrate this statement. In the inner region of NGC 6946, the small but compact 
bulge component causes a sharp feature in Vf,(r) that declines rapidly before the rotation curve 
rises again as mass from the disk begins to contribute. The up-down-up morphology predicted by 
the observed distribution of the baryons is observed in high resolution observations [55l 1115] . A 
dark matter halo with a monotonically varying density profile cannot produce such a morphology; 
the stellar bulge must be the dominant mass component at small radii in this galaxy. 

A surprising aspect of Renzo's rule is that it applies to low surface brightness galaxies as 
well as those of high surface brightness. That the baryons should have some dynamical impact 
where their surface density is highest is natural, though there is no reason to demand that they 
become competitive with dark matter. What is distinctly unnatural is for the baryons to have 
a perceptible impact where dark matter must clearly dominate. NGC 1560 provides an example 
where they appear to do just that. The gas distribution in this galaxy shows a substantial kink 
in its surface density profile [52] (recently confirmed by [164] ) that has a distinct impact on Vb{r). 
This occurs at a radius where V Vt, so dark matter should be dominant. A spherical dark 
matter halo with particles on randomly oriented, highly radial orbits cannot support the same sort 
of structure as seen in the gas disk, and the spherical geometry, unlike a disk geometry, would 
smear the effect on the local acceleration. And yet the wiggle in the baryonic rotation curve is 
reflected in the total, as per Renzo's ruleP^. 

One inference that might be made from these observations is that the dark matter is baryonic. 
This is unacceptable from a cosmological perspective, but it is possible to have a multiplicity of 
dark matter components. That is, we could have baryonic dark matter in the disks of galaxies in 

Note that such wiggles are often associated with spiral arm features (the existence of which in LSB galaxies 
being itself challenging in the presence of a massive dark matter halo, see Sect. 4.2), and hence associated with 
non-circular motions. It is conceivable that such observed wiggles are partly due to these, but the effect of local 
density contrasts due to spiral arms on the tangential velocity should be damped by the global effect of the spherical 
dark matter halo, which is apparently not the case. 
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Figure 13: Surface density profiles (top) and rotation curves (bottom) of two galaxies: the high 
surface brightness spiral NGC 6946 (Figure [121 left) and the low surface brightness galaxy NGC 
1560 (right). The surface density of stars (blue circles) is estimated by azimuthal averaging in 
ellipses fit to the A'-band (2.2/im) light distribution. Similarly, the gas surface density (green 
circles) is estimated by applying the same procedure to the 21 cm image. Note the different scale 
between low and high surface brightness galaxies. Also note features like the central bulge of 
NGC 6946, which corresponds to a sharp increase in stellar surface density at small radius. In the 
lower panels, the observed rotation curves (data points) are shown together with the baryonic mass 
models (lines) constructed from the observed distribution of baryons. Velocity data for NGC 6946 
include both HI data that define the outer, flat portion of the rotation curve [67] and Ha data from 
two independent observations [55| 1115] that define the shape of the inner rotation curve. Velocity 
data for NGC 1560 come from two independent interferometric HI observations [29 |ll64j . Baryonic 
mass models are constructed from the surface density profiles by numerical solution of the Poisson 
equation using GIPSY [473] . The dashed blue line is the stellar disk, the red dot-dashed line is the 
central bulge, and the green dotted line is the gas. The solid black line is the sum of all baryonic 
components. This provides a decent match to the rotation curve at small radii in the high surface 
brightness galaxy, but fails to explain the flat portion of the rotation curve at large radii. This 
discrepancy, and its systematic ubiquity in spiral galaxies, ranks as one of the primary motivations 
for dark matter. Note that the mass discrepancy is large at all radii in the low surface brightness 
galaxy. 
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addition to a halo of non-baryonic cold dark matter. It is often possible to scale up the atomic 
gas component to fit the total rotation |194j . That implies a component of mass that is traced 
by the atomic gas - presumably some other dynamically cold gas component - that outweighs the 
observed hydrogen by a factor of six to ten |194] . One hypothesis for such a component is very 
cold molecular gas |353j . It is difficult to exclude such a possibility, though it also appears to be 
hard to sustain in LSB galaxies [293'. Dynamically, one might expect the extra mass to destabilize 
the LSB disk. One also returns to a fine-tuning between baryonic surface density and mass-to-light 
ratio. In order to maintain the balance observed in Figure [5J relatively more dark molecular gas 
will be required in lower surface brightness galaxies so as to maintain a constant surface density of 
gravitating mass, but given the interactions at hand, this might be at least a bit more promising 
than explaining it with CDM halos. 

As a matter of fact, low surface brightness galaxies play a critical role in testing many of 
the existing models for dark matter. This happens in part because they were appreciated as 
an important population of galaxies only after many relevant hypotheses were established, and 
thus provide good tests of their a priori expectations. Observationally, we infer that low surface 
brightness disks exhibit large mass discrepancies down to small radii |120j . Conventionally, this 
means that dark matter completely dominates their dynamics: the surface density of baryons 
in these systems is never high enough to be relevant. Nevertheless, the observed distribution of 
baryons suffices to predict the total rotation |279[ 1121) . Once again, the baryonic tail wags the 
dark matter dog, with the observations of the minority baryonic component sufficing to predict 
the distribution of the dominant dark matter. Note that, reversely, nothing is "observable" about 
the dark matter, in present-day simulations, that predicts the distribution of baryons. 

We thus see that there are many observations, mostly on galaxy scales, that are unpredicted, 
and perhaps unpredictable, in the standard dark matter context. They mostly involve a unique 
relationship between the distribution of baryons and the gravitational field, as well as an acceler- 
ation constant aq of the order of the square-root of the cosmological constant, and they represent 
the most significant challenges to the current ACDM model. 
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5 Milgrom's Empirical Law and "Kepler Laws" of Galactic 
Dynamics 

Up to this point in this review, the chahenges that we have presented have been purely based on 
observations, and fuUy independent of any alternative theoretical framework. However, at this 
point, it would obviously be a step forward if at least some of these puzzling observations could 
be summarized and empirically unified in some way, as such a unifying process is largely what 
physics is concerned with, rather than simply exposing a jigsaw of apparently unrelated empirical 
observations. And such an empirical unification is actually feasible for many of the unpredicted 
observations presented in the previous section, and goes back to a rather old idea of the Israeli 
physicist Mordehai Milgrom. 

Almost 30 years ago, back in 1983 (and thus before most of the aforementioned observations 
had been carried out), simply prompted by the question of whether the missing mass problem 
could perhaps refiect a breakdown of Newtonian dynamics in galaxies, Milgrom [294 devised a 
formula linking the Newtonian gravitational acceleration ^at to the true gravitational acceleration 
g in galaxies. Such attempts to rectify the mass discrepancy by gravitational means often begin 
by noting that galaxies are much larger than the Solar system. It is easy to imagine that at 
some suitably large scale, let's say of the order of 1 kpc, there is a transition from the usual 
dynamics applicable in the comparatively tiny Solar system to some more general theory that 
applies on the scale of galaxies in order to explain the mass discrepancy problem. If so, we would 
expect the mass discrepancy to manifest itself at a particular length scale in all systems. However, 
as already noted hereabove, there is no universal length scale apparent in the data (Figure \TO\i 
|402| 12671 14071 [279] 1271) . The mass discrepancy appears already at small radii in some galaxies; in 
others there is no apparent need for dark matter until very large radii. This now observationally 
excludes all hypotheses that simply alter the force law at a linear length-scale. 

5.1 Milgrom's law and the dielectric analogy 

Before such precise data were available, Milgrom |294) already noted that other scales were also 
possible, and that one that is as unique to galaxies as size is acceleration. The typical centripetal 
acceleration of a star in a galaxy is of order ~ 10~^°ms~^. This is eleven orders of magnitude less 
than the surface gravity of the Earth. As we have seen in the previous section, this acceleration 
constant appears "miraculously" in very different scaling relations that should in principle not 
be related with each otheiF^. This observational evidence for the universal appearance of ao — 
10~^°ms~^ in galactic scaling relations was not at all observationally evident back in 1983. What 
Milgrom ^294' then hypothesized was a modification of Newtonian dynamics below this acceleration 
constant ao, appropriate to the tiny accelerations encountered in galaxiej^. This new constant oq 
would then play a similar role as the Planck constant h in quantum physics or the speed of light 
c in special relativity. For large acceleration (or force per unit mass), F/m — g ^ qq, everything 
would be normal and Newtonian, i.e., g = g^- Or, put differently, formally taking oq — )■ should 
make the theory tend to standard physics, just like recovering classical mechanics for h ^ Q. On 
the other hand, formally taking aq — oo (and G — 0), or equivalently, in the limit of small 
accelerations g <^ a^, the modification would apply in the form: 

9 = VdNao, (4) 

^^Note that many of these relations were scrutinized during the last 30 years because they were pointed to by 
Milgrom's law. This law thus already achieved an important role of a theoretical idea, i.e. to point an direct 
observations and their arrangement 

Of course, there is also a natural length scale associated with this acceleration constant, I = c^/ao, but this 
length scale will enter the modification nonlinearly, and is thus not the length at which the modification would be 
seen in galaxies, as it is rather of the order of the Hubble radius 
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where g = |g| is the true gravitational acceleration, and qn ~ \^n\ the Newtonian one as calculated 
from the observed distribution of visible matter. Note that this limit follows naturally from the 
scale- invariance symmetry of the equations of motion under transformations (t, r) — > (At, Ar) [316] . 
This particular modification was only suggested in 1983 by the asymptotic flatness of rotation 
curves and the slope of the Tully-Fisher relation. It is indeed trivial to see that the desired 
behavior follows from equation (j4|). For a test particle in circular motion around a point mass 
M, equilibrium between the radial component of the force and the centripetal acceleration yields 
/r = gN = GM/r"^ . In the weak-acceleration limit this becomes 



The terms involving the radius r cancel, simplifying to 

V^{r) = Vf = aoGM. (6) 

The circular velocity no longer depends on radius, asymptoting to a constant V/ that depends only 
on the mass of the central object and fundamental constants. The equation above is the equivalent 
of the observed baryonic Tully-Fisher relation. It is often wrongly stated that Milgrom's formula 
was constructed in an ad hoc way in order to reproduce galaxy rotation curves, while this statement 
is only true of these two observations: (i) the asymptotic flatness of the rotation curves, and (ii) the 
slope of the baryonic Tully-Fisher relation (but note that, at the time, it was not clear at all that 
this slope would hold, nor that the Tully-Fisher relation would correlate with baryonic mass rather 
than luminosity, and even less clear that it would hold over orders of magnitude in mass). All the 
other successes of Milgrom's formula related to the phenomenology of galaxy rotation curves were 
pure predictions of the formula made before the observational evidence. The predictions that are 
encapsulated in this simple formula can be thought of as sort of "Kepler-like laws" of galactic 
dynamics. These various laws only make sense once they are unified within their parent formula, 
exactly as Kepler's laws only make sense once they are unified under Newton's law. 

In order to ensure a smooth transition between the two regimes g ^ and g ^ oq, Milgrom's 
law is written in the following way: 

where the interpolating function 

/i(x) ^ 1 for X ^ 1 and /i(x) — ^ x for x <C 1. (8) 

Written like this, the analogy between Milgrom's law and Coulomb's law in a dielectric medium 
is clear, as noted in |57j . Indeed, inside a dielectric medium, the amplitude of the electric field E 
generated by an external point charge Q located at a distance r obeys the following equation: 

^i{E)E - (9) 

where fj, is the relative permittivity of the medium, and can depend on E. In the case of a 
gravitational field generated by a point mass M, it is then clear that Milgrom's interpolating 
function plays the role of " gravitational permittivity". Since it is smaller than 1, it makes the 
gravitational field stronger than Newtonian (rather than smaller in the case of the electric field in 
a dielectric medium, where /i > 1). In other words, the gravitational susceptibility coefficient x 
(such that iJ,=l+x) is negative, which is correct for a force law where like masses attract rather 
than repel |57]. This dielectric analogy has been explicitly used in devising a theory [HI] where 
Milgrom's law arises from the existence of a "gravitationally polarizable" medium (see Sect. 7). 
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Of course, inverting the above relation, Milgrom's law can also be written as 



g 




(10) 



where 



iy{y) — ^ 1 for y ^ 1 and h'{y) y ^1"^ for y ^ 1. 



(11) 



However, as we shall see in the next section, in order for g to remain a conservative force field, 
these expressions (Eqs. [71 and ITU)) cannot be rigorous outside of highly symmetrical situations. It 
nevertheless allows one to make numerous very general predictions for galactic systems, or in other 
words, to derive "Kepler-like laws" of galactic dynamics, unified under the banner of Milgrom's law. 
As we shall see, many of the observations unpredicted by ACDM on galaxy scales naturally ensue 
from this very simple law. However, even though Milgrom originally devised this as a modification 
of dynamics, this law is a 'priori nothing more than an algorithm which allows one to calculate the 
distribution of force in an astronomical object from the observed distribution of baryonic matter. 
Its success would simply mean that the observed gravitational field in galaxies is mimicking a 
universal force law generated by the baryons alone, meaning that (i) either the force law itself is 
modified, or that (ii) there exists an intimate connection between the distribution of baryons and 
dark matter in galaxies. 

It was for instance suggested |219^ that such a relation might arise naturally in the CDM 
context, if halos possess a one-parameter density profile that leads to a characteristic acceleration 
profile that is only weakly dependent upon the mass of the halo. Then with a fixed collapse 
factor for the baryonic material, the transition from dominance of dark over baryonic occurs at a 
universal acceleration, which by numerical coincidence, is of the order of cHq and thus of ao (see 
also |412j ). While, still today, it remains to be seen whether this scenario would quantitatively hold 
in numerical simulations, it was noted by Milgrom [307] that this scenario only explained the role 
of ao as a transition radius between baryon and dark matter dominance in high-surface brightness 
(HSB) galaxies, precluding altogether the existence of low-surface brightness (LSB) galaxies where 
dark matter dominates everywhere. The real challenge for ACDM is rather to explain all the 
different roles played by ag in galaxy dynamics, different roles that can all be summarized within 
the single law proposed by Milgrom, just like Kepler's laws are unified under Newton's law. We list 
these Kepler-like laws of galactic dynamics hereafter, and relate each of them with the unpredicted 
observations of Sect. 4, keeping in mind that these were mostly a priori predictions of Milgrom's 
law, made before the data were as good as today, not "postdictions" like we are used to in modern 
cosmology. 

5.2 Galactic Kepler-like laws of motion 

1. Asymptotic flatness of rotation curves. The rotation curves of galaxies are asymptot- 
ically fiat, even though this fiatness is not always attained at the last observed point (see 
point hereafter about the shapes of rotation curves as a function of baryonic surface density) . 
What is more, Milgrom's law can be thought of as including the total acceleration with re- 
spect to a preferred frame, which can lead to the prediction of asymptotically falling rotation 
curves for a galaxy embedded in a large external gravitational field (see Sect. 6.3). 

2. Gao deflning the zero-point of the baryonic Tully Fisher relation. The plateau of 
a rotation curve is Vf = (GMaoY^^ . The true Tully-Fisher relation is predicted to be a 
relation between this asymptotic velocity and baryonic mass, not luminosity. Milgrom's law 
yields immediately the slope (precisely 4) and zero-point of this baryonic Tully-Fisher law. 
The observational baryonic Tully-Fisher relation should thus be consistent with zero scatter 
around this prediction of Milgrom's law (the dotted line of Figure [3]). And indeed it is. All 
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rotationally supported systems in the weak acceleration limit should fall on this relation, 
irrespective of their formation mechanism and history, meaning that completely isolated 
galaxies or tidal dwarf galaxies formed in interaction events all behave as every other galaxy 
in this respect. 

3. Gao defining tiie zero-point of tiie Faber Jackson relation. For quasi-isothermal 
systems |297j . such as elliptical galaxies, the bulk velocity dispersion depends only on the 
total baryonic mass via <7* ~ GMao- Indeed, since the equation of hydrostatic equilibrium for 
an isotropic isothermal system in the weak field regime reads d{a'^p)/dr = ~p{GMaoY''^ /r, 
one has cr^ = x GMuq where a = dlnp/dlnr. This underlies the Faber- Jackson relation 
for elliptical galaxies (Figure [71) , which is however not predicted by Milgrom's law to be as 
tight and precise (because it relies e.g., on isotherniality and on the slope of the density 
distribution) as the BTFR. 

4. Mass discrepancy defined by the inverse of the acceleration in units of Uq. Or 

alternatively, defined by the inverse of the square-root of the gravitational acceleration gen- 
erated by the baryons in units of oq. The mass discrepancy is precisely equal to this in 
the very low-acceleration regime, and leads to the baryonic Tully-Fisher relation. In the 
low-acceleration limit, gN /g = g/o-Oi so in the CDM language, inside the virial radius of any 
system whose virial radius is in the weak acceleration regime (well below oq), the baryon 
fraction is given by the acceleration in units of oq. If we adopt a rough relation M^qq — 
1.5 X 10^ M© X V^'^(km/s)~^, we get that the acceleration at -R500, and thus the system baryon 
fraction predicted by Milgrom's formula, is Mb/M^Qo — asoo/ao — 4 x 10"'* x 14(km/s)~*. 
Divided by the cosmological baryon fraction, this explains the trend for fd = Mt,/ {0.17 M^qq) 
with potential ($ = Vj?) in Figure [21 thereby naturally explaining the halo-by-halo missing 
baryon challenge in galaxies. No baryons are actually missing; rather, we infer their existence 
because the natural scaling between mass and circular velocity M500 c>c in ACDM differs 
by a factor of Vc from the observed scaling Mi, oc V^. 

5. Qq as the characteristic acceleration at the effective radius of isothermal spheres. 

As a corollary to the Faber- Jackson relation for isothermal spheres, let us note that the bary- 
onic isothermal sphere would not require any dark matter up to the point where the internal 
gravity falls below ao, and would thus resemble a purely baryonic Newtonian isothermal 
sphere up to that point. But at larger distances, in the presence of the added force due to 
Milgrom's law, the baryonic isothermal sphere would rather fall [297] as r~'^, thereby making 
the radius at which the gravitational acceleration is oq the effective baryonic radius of the 
system, thereby explaining why, at this radius R in quasi-isothermal systems, the typical 
acceleration / R is almost always observed to be of the order of oq. Of course, this is 
valid for systems where such a transition radius does exist, but going to very low surface 
brightness systems, if the internal gravity is everywhere below oq, one can then have typical 
accelerations as low as one wishes. 

6. Qq/G as a critical mean surface density for stability. Disks with mean surface density 
(S) < ~ qq/G have added stability. Most of the disk is then in the weak-acceleration 
regime, where accelerations scale as a oc VM, instead of a oc M. Thus 5a/ a = {1/2)SM/M 
instead of Sa/a = 6M/M, leading to a weaker response to small mass perturbations }300j . 
This explains the Freeman limit (Figure [SJ . 

7. Qq as a transition acceleration. The mass discrepancy in galaxies always appears (tran- 
sition from baryon dominance to dark matter dominance) when / R ~ oq, yielding a clear 
mass-discrepancy acceleration relation (Figure ITOl). This, again, is the case for every sin- 
gle rotationally supported system irrespective of its formation mechanism and history. For 
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HSB galaxies, where there exists two distinct regions where /R > oq in the inner parts 
and V^/R < ap in the outer parts, locally measured mass-to-light ratios should show no 
indication of hidden mass in the inner parts, but rise beyond the radius where / R w oq 
(Figure [14]) . Note that this is the only role of ao that the scenario of [219] was poorly trying 
to address (forgetting, e.g., about the existence of LSB galaxies). 

8. ao/G as a transition central surface density. The acceleration oq defines the transition 
from HSB galaxies to LSB galaxies: baryons dominate in the inner parts of galaxies whose 
central surface density is higher than some critical value of the order of S-f = ao/G, while in 
galaxies whose central surface density is much smaller (LSB galaxies), DM dominates every- 
where, and the magnitude of the mass discrepancy is given by the inverse of the acceleration 
in units of ao, see (5) below. The mass discrepancy thus appears at smaller radii and is more 
severe in galaxies of lower baryonic surface densities (Figure [T4| . The shapes of rotation 
curves are predicted to depend on surface density: HSB galaxies are predicted to have rota- 
tion curves that rise steeply then become flat, or even fall somewhat to the not-yet-reached 
asymptotic flat velocity, while LSB galaxies are supposed to have rotation curves that rise 
slowly to the asymptotic flat velocity. This is precisely what is observed (Figure [T5]) . and 
is in accordance [163] with the more complex empirical parametrization of observed rota- 
tion curves that has been proposed in |377] . Finally the total (baryons-|-DM) acceleration is 
predicted to decline with the mean baryonic surface density of galaxies, exactly as observed 
(Figure [16]) , in the form a cx (see also Figure [9]). 

9. ao/27rG as the central surface density of dark halos. Provided they are mostly 
in the Newtonian regime, galaxies are predicted to be embedded in dark halos (whether 
real or virtual, i.e., "phantom" dark matter) with a central surface density of the order of 
ao/(27rG) as observecO. LSBs should have a halo surface density scaling as the square- root 
of the baryonic surface density, in a much more compressed range than for the HSB ones, 
explaining the consistency of observed data with a constant central surface density of dark 
matter [T68][3T4]. 

10. Features in the baryonic distribution imply features in the rotation curve. Because 
a small variation in will be directly translated into a similar one in g, Renzo's rule 
(Sect. 4.3.4) is explained naturally. 

As a conclusion, all the apparently independent roles that the characteristic acceleration ao 
plays in the unpredicted observations of Sect. 4.3 (see end of Sect. 4.3.3 for a summary), as well as 
Renzo's rule (Sect. 4.3.4), have been elegantly unified by the single law proposed by Milgrom |294] 
in 1983 as a unique scaling relation between the gravitational field generated by observed baryons 
and the total observed gravitational force in galaxies. 



^^Note that the denominator 2ttG comes from integrating the phantom dark matter density along a vertical line 
as per |314| . which leads to a slightly smaller characteristic surface density for phantom dark matter than the S| 
defining Freeman limit in the 6th law hereabove 
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Figure 14: The mass discrepancy (as in Figure [TOl) as a function of radius in observed spiral 
galaxies. The curves for individual galaxies (lines) are color-coded by their characteristic baryonic 
surface density (as in Figure [5|). In order to be completely empirical and fully independent of any 
assumption such as maximum disk, stellar masses have been estimated with population synthesis 
models ;43 . The amplitude of the mass discrepancy is initially small in high surface density 
galaxies, and grows only slowly at large radii. As the baryonic surface densities of galaxies decline, 
the mass discrepancy becomes more severe and appears at smaller radii. This trend confirms one 
of the a priori predictions of Milgrom's law [295] . 
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Figure 15: The shapes of observed rotation curves depend on baryonic surface density (color coding 
as per Figure [T^. High surface density galaxies have rotation curves that rise steeply then become 
flat, or even fall somewhat to the asymptotic flat velocity. Low surface density galaxies have 
rotation curves that rise slowly to the asymptotic flat velocity. This trend confirms one of the a 
priori predictions of Milgrom's law |295j . 
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Figure 16: Centripetal acceleration as a function of radius and surface density (color coding as per 
Figure [TH) . The critical acceleration uq is denoted by the dotted line. Milgrom's formula predicts 
that acceleration should decline with baryonic surface density, as observed. Moreover, high surface 
density galaxies transition from the Newtonian regime at small radii to the weak-field regime at 
large radii, whereas low surface density galaxies fall entirely in the regime of low acceleration 
a < ao, as anticipated by Milgrom [295] . 
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6 Milgrom's Law as a Modification of Classical Dynamics: 
MOND 



It thus appears that many puzzhng observations, that are difficult to understand in the ACDM 
context (and/or require an extreme fine-tuning of the DM distribution), are well summarized by a 
single heuristic law. It would therefore appear natural that this law derives from a universal force 
law, and would refiect a modification of dynamics rather than the addition of massive particles 
interacting (almost) only gravitationally with baryonic mattei0. However, applying blindly Eq. [7] 
to a set of massive bodies directly leads to serious problems [1511 1294] such as the non-conservation 
of momentum. In a two-body configuration, as the implied force is not symmetric in the two 
masses, Newton's third law (action and reaction principle) does not hold, so the momentum is 
not conserved. Consider a translationally invariant isolated system of such two masses uii and 
m2 small enough to be in the very weak acceleration limit, and placed at rest on the x-axis. The 
amplitude of the Newtonian force is then _Fjv = 017111712/ {x2 — xi)"^, and applying blindly Eq. [TJ 
would lead to individual accelerations ja^j = F^ao/rui. This then immediately leads to 



meaning that for different masses, the momentum of this isolated system is not conserved. This 
thus means that Eq. [7] cannot truly represent a universal force law. If Eq. [7] is to be more than 
just a heuristic law summarizing how dark matter is arranged in galaxies with respect to baryonic 
matter, it must then be an approximation (valid only in highly symmetric configurations) of a 
more general force law deriving from an action and a variational principle. Such theories at the 
classical level can be classified under the acronym MOND, for Modified Newtonian Dvnamic^F^. 
In this section, we sketch how to devise such theories at the classical level, and list detailed tests 
of these theories at all astrophysical scales. 

6.1 Modified inertia or modified gravity: Non-relativistic actions 

If one wants to modify dynamics in order to reproduce Milgrom's heuristic law while still benefiting 
from usual conservation laws such as the conservation of momentum, one can start from the action 
at the classical level. Clearly such theories are only toy-models until they become the weak-field 
limit of a relativistic theory (see Sect. 7), but they are useful both as targets for such relativistic 
theories, and as internally consistent models allowing to make predictions at the classical level (i.e., 
neither in the relativistic or quantum regime). 

A set of particles of mass moving in a gravitational field generated by the matter density 
distribution p — Wi(5(x — x^) and described by the Newtonian potential $Ar has the following 

'^^ Note that the main motivation for modifying dynamics is thus not to get rid of DM, but to explain why the 
observed gravitational field in galaxies is apparently mimicking a universal force law generated by the baryons alone. 
The simplest explanation is of course a priori not that DM arranges itself by chance to mimick this force law, but 
rather that the force law itself is modified. Note that at a fundamental level, relativistic theories of modified gravity 
often will have to include new fields to reproduce this force law, so that dark matter is effectively replaced by "dark 
fields" in these theories, or even by dark matter exhibiting a new interaction with baryons (one could speak of "dark 
matter" if the stress-energy tensor of the new fields is numerically comparable to the density of baryons) : this makes 
the confrontation between modified gravity and dark matter less clear than often believed. The actual confrontation 
is rather that between all sorts of theories embedding the phenomenology of Milgrom's law vs. theories of DM 
made of simple self-uninteracting billiard balls assembling themselves in galactic halos under the sole influence of 
unmodified gravity, theories which currently appear unable to explain the observed phenomenology of Milgrom's 
law. 

Generally covariant theories approaching these classical theories in the weak-field limit will then also be classified 
under this same MOND acronym, even if they really are Modified Einsteinian Dynamics (see Sect. 7) 
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Varying this action with respect to configuration space coordinates yields the equations of motion 
d^x/di^ = — V<i>Ar, while varying it with respect to the potential leads to Poisson equation V^<I>Ar = 
iirGp. Modifying the first (kinetic) term is generaUy referred to as "modified inertia" and modifying 
the last term as "modified gravity"!^. 

6.1.1 Modified inertia 

The first possibihty, modified inertia, has been investigated by Milgrom [30111322] . who constructed 
modified kinetic actionQ (the first term S'kin in Eq. [T5|) that are functionals depending on the 
trajectory of the particle as well as on the acceleration constant ao- By construction, the gravi- 
tational potential is then still determined from the Newtonian Poisson equation, but the particle 
equation of motion becomes, instead of Newton's second law: 



where A is a functional of the whole trajectory {x{t)}, with the dimensions of acceleration. The 
Newtonian and MOND limits correspond to [gq — >■ 0, A — > cPx/dt^] and [oq — oo, A[{x(i)}, ao] — )• 
'^(7^Q({^(^)})] where Q has dimensions of acceleration squared. 

Milgrom [301] investigated theories of this vein and rigorously showed that they always had to 
be time-nonlocal (see also Sect. 7.10) to be Galilean invarianlF^. Interestingly, he also showed that 
quantities such as energy and momentum had to be redefined but were then enjoying conservation 
laws: this even leads to a generalized virial relation for bound trajectories, and in turn to an 
important and robust prediction for circular orbits in an axisymmetric potential, shared by all 
such theories. Eq. 1141 becomes for such trajectories: 



where, Vc and R are the orbital speed and radius, and p-{x) is universal for each theory, and is 
derived from the expression of the action specialized to circular trajectories. Thus, for circular 
trajectories, these theories recover exactly the heuristic Milgrom's law. Interestingly, it is this law 
which is used to fit galaxy rotation curves, while in the modified gravity framework of MOND 
(see hereafter), one should actually calculate the exact predictions of the modified Poisson formu- 
lations which can differ a little bit from Milgrom's law. However, for orbits other than circular, it 
becomes very difficult to make predictions in modified inertia, as the time non-locality can make 
the anomalous acceleration at any location depend on properties of the whole orbit. For instance, 
if the accelerations are small on some segments of a trajectory, MOND effects can be felt also on 

■^^The Newtonian mass density also satisfies the continuity equation dp/dt + V.(pv) = 

In General Relativity, the first two terms J p(v^ /2 — ^i^)d^x dt are lumped together into the matter action 
(also containing the rest mass contribution in GR), and the last term is generalized by the Einstein-Hilbert action 
Lot us note in passing that it would not be the first time that the kinetic action would bo modified as special 
relativity does just this too, changing for a single particle mv'^ /2 —^ —mc'^^~^ in (where 'y(v) = 1/ -y/l — (d/c)^), 
leading for a moving body to a redefinition of the effective mass as mcft = m'y{v). With this analogy in mind, a 
rather simplified view of the Lorentz-breaking modification of inertia needed in order to reproduce MOND would 
be that m^f[ ~ mp,(a), where a is the amplitude of the acceleration with respect to an absolute preferred inertial 
frame. 

Such non-local theories, which also have to be nonlinear (like any MOND theory) are not easy to construct, and 
there is presently no real fully-fledged theory which has been developed in this vein, although hints in this direction 
are summarized in Sect. 7.10. 
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segments where the accelerations are high, and conversely |322) . This can thus give rise to different 
effects on bound and unbound orbits, as well as on circular and highly elliptic orbits, meaning that 
"predictions" of modified inertia in pressure-supported systems could differ significantly from those 
derived from Milgrom's law per se. Let us finally note that testing modfied inertia on Earth would 
need to properly define an inertial reference frame, contrary to what has been done in [5l I180j 
where the laboratory itself was not an inertial frame. Proper set-ups for testing modified inertia 
on Earth have been described, e.g. in |202| 1203] : under the circumstances described in these pa- 
pers, modified inertia would inevitably predict a departure from Newtonian dynamics, even if the 
exact departure cannot be predicted at present, except for circular motion. 



6.1.2 Bekenstein Milgrom MOND 

The idea of modified gravity is to preserve the particle equation of motion by preserving the kinetic 
action, but to change the gravitational action, and thus modify the Poisson equation. In that case, 
all the usual conservation laws will be preserved by construction. 
A very general way to do so is to write [39] : 

OgravBM — - « X dt, (ib) 

where F can be any dimensionless function. The Lagrangian being non-quadratic in jV^*!, this 
has been dubbed by Bekenstein & Milgrom [5U] Aquadratic Lagrangian theory (AQUAL). Varying 
the action with respect to i> then leads to a non-linear generalization of the Newtonian Poisson 
equatioiJ^: 



ao 



= AnGp (17) 



where = F'{z) and z — x'^. In order to recover the /i- function behavior of Milgrom's law 
(Eq. [7]), i.e., 1 for a; ^ 1 and — ?► a; for x <C 1, one needs to choose: 

2 

F{z) ^ 2 for z > 1 and F{z) -z^/^ for z < 1. (18) 

The general solution of the boundary value problem for Eq. [17] leads to the following relation 
between the acceleration g = — V<I> and the Newtonian one, g^ = — V<I>Ar 

A' g = gAT + S, (19) 

where g = |g|, and S is a solenoidal vector field with no net flow across any closed surface (i.e., a 
curl field S = V x A such that V.S = 0). It is thus equivalent to Milgrom's law (Eq. [7]) up to a 
curl field correction, and is precisely equal to Milgrom's law in highly symmetric one-dimensional 
systems, such as spherically symmetric systems or flattened systems for which the isopotentials 
are locally spherically symmetric. For instance, the Kuzmin disk |53) is an example of a flattened 
axisymmetric conflguration for which Milgrom's law is precisely valid, as its Newtonian potential 
$Ar = —GM I ^ B? -f (6-1- \z\Y is equivalent on both sides of the disk to that of a point mass above 
or below the disk respectively. 

In vacuum and at very large distances from a body of mass M, the isopotentials always tend 
to become spherical and the curl field tends to zero, while the gravitational acceleration falls well 
below ao (a regime known as the "deep-MOND" regime), so that: 



$(r) - VGMao ln(r). (20) 



Following the dielectric analogy (Sect. 5.1), this is akin to Maxwell's first equation, Gauss' law, in terms of free 
charge density pf, i.e., V.[/xeoE] = p^, where E is the electric field and ^eqE = D is the electric displacement field. 
See | 57| for a thorough discussion of the analogy. 
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An important point, demonstrated by Bekenstein & Milgrom [39j, is that a system with a low 
center-of-mass acceleration with respect to a larger (more massive) system, sees the motion of its 
constituents combine to give a MOND motion for the center-of-mass even if it is made of con- 
stituents whose internal accelerations are above ao (for instance a compact globular cluster moving 
in the outer Galaxy). The center-of-mass acceleration is independent of the internal structure of 
the system (if the mass of the system is small) , namely the Weak Equivalence Principle is satisfied. 

In a modified gravity theory, any time-independent system must still satisfy the virial theorem: 



2K + W ^0. 



(21) 



where K = M{v'^)/2 is the total kinetic energy of the system, M = J2i''^i being the total mass 
of the system, (w^) the second moment of the velocity distribution, and W = — J pa .\/<^(fx 
is the "virial", proportional to the total potential energy. Milgrom |302l 1303] showed that, in 
Bekenstein-Milgrom MOND, the virial is given by: 



W 



ao 
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^a2F(|V$|Vao)-M(|V$|/ao)|V$| 



(22) 



For a system entirely in the extremely weak field limit (the "deep-MOND" limit x — g/ao ^ 
1) where fi{x) = x and F{z) = (2/3)z'^/^, the second term vanishes and we thus get W = 
{—2/3)^/GAPao (see [302] for the specific conditions for this to be valid). In this case, we can get 
an analytic expression for the two-body force under the approximation that the two bodies are very 
far apart compared to their internal sizes |302l 15101 1512] . Since the kinetic energy K = iforb + ^int 
can be separated into the orbital energy Korh = 'mim2v'^^^/ {2M) and the internal energy of the 

we get from the scalar virial theorem of a stationary system: 



bodies Kint = X](l/3) vGmfoo 



M 



(23) 



We can then assume an approximately circular velocity such that the two-body force (satisfying 
the action and reaction principle) can be written analytically in the deep-MOND limit as : 



-F2body 



mim2 t^rel 
mi + 1712 r 



(mi + vn-if'l'^ — rrii — m' 



3/2 



VGoo 



(24) 



The latter equation is not valid for N-body configurations, for which the Bekenstein-Milgrom 
(BM) modified Poisson equation (Ea.ll7p must be solved numerically (apart from highly symmetric 
N-body configurations). This equation is a non- linear elliptic partial differential equation. It can 
be solved numerically using various methods [STJ [751 HZl fH5] I^STlHSg] . One of them [TS] Ii55] is 
to use a multigrid algorithm to solve the discrete form of Eq. [17] (see also Figure [TT]): 



(25) 



-(*»J,fc+l - 



M2 



($jj\fc - 



-l,j,k)PLi 



^t,j-l,k)P-L2 



i.],k-i)pL3\/h 



where 

• Pi j,fc is the density discretized on a grid of step h, 

• ^i^j^k is the MOND potential discretized on the same grid of step h, 
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• fiMi, and /iij^ , are the values of ^i{x) at points Mi and Li corresponding to + 1 /2, j, k) and 
{i — 1/2, j, k) respectively (Figure [T7]). 

The gradient component {d / dx,d / dy^d / dz), in ^{x), are approximated in the case of by 
([$(5) - $(A)]//i, [$(/) + $(i/) - $(ii:) - $(J)]/(4/i), [$(C) + $(!)) - $(£;) - $(F)]/(4/i)) (see 
Figure fT7|. 

In [458j , the Gauss-Seidel relaxation with red and black ordering is used to solve this discretized 
equation, with the boundary condition for the Dirichlet problem given by Eq.[20]at large radii. It is 
obvious that subsequently devising an evolving N-body code for this theory can only be done using 
particle-mesh techniques rather than the gridless multipole expansion treecode schemes widely 
used in standard gravity. 

Finally, let us note that it could be imagined that MOND, given some of its observational 
problems (developed in Sect. 6.6), is incomplete and needs a new scale in addition to oq. There are 
several ways to implement such an idea, but for instance, Bekenstein 37^ proposed in this vein a 
generalization of the AQUAL formalism by adding a velocity scale sq, in order to allow for effective 
variations of the acceleration constant as a function of the deepness of the potential, namely: 

^gravBck = " / ale'^"^ ' F i]^^^ e"" ' / al)d^ X dt, (26) 



leading to 

V. " ■ 



^ ( J L ) V$ 

aooff 



^ /EfiU<iL^(^,=4.G., (27, 

So V aocff / S5 V aocff 



where aooS = aoe~*/*o. Interestingly, with this "modified MOND", Gauss' theorem (or Newton's 
second theorem) would no longer be valid in spherical symmetry. A suitable choice of Sq (e.g., 
of the order of 10^ km/s, see [37]) could affect the dynamics of galaxy clusters (by boosting the 
modification with an effectively higher value of oq) compared to the previous MOND equation, 
while keeping the less massive systems such as galaxies typically unaffected compared to usual 
MOND, while other (lower) values of so could allow (modulo a renormalization of ao) for a stronger 
modification in galaxy clusters as well as milder modification in subgalactic systems such as globular 
clusters, which, as we shall see hereafter could be interesting from a phenomenological point of 
view (see Sect. 6.6). However, the possibility of too strong a modification should be carefully 
investigated, as well as, in a relativistic (see Sect. 7) version of the theory, the consequences on the 
dynamics of a scalar-field with a similar action. 



6.1.3 QUMOND 

Another way [319] of modifying gravity in order to reproduce Milgrom's law is to still keep the 
"matter action" unchanged S'kin + S'in = / /9(v^/2 — ^)d^xdt, thus ensuring that varying the 
action of a test particle with respect to the particle degrees of freedom leads to d^x/dt^ = — V^, 
but to invoke an auxiliary acceleration field gAr = — V^&at in the gravitational action instead of 
invoking an aquadratic Lagrangian in |V$|. The addition of such an auxiliary field can of course 
be done without modifying Newtonian gravity, by writing the Newtonian gravitational action in 
the following wajf^: 

^gravN = j (2V$.gjV - g%)d^xdt. (28) 

It gives, after variation over gAr (or over ^n)- Sn = — V<I>. And after variation of the full action 
over — V.gAr = AirGp, i.e., Newtonian gravity. One can then introduce a MONDian modification 

^'^ This is similar to the Palatini formalism of GR, where the present auxiliary acceleration field is replaced by a 
connection 
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Figure 17: Discretisation scheme of the BM modified Poisson equation (Eq.[T7|) and of the phantom 
dark matter derivation in QUMOND. The node k) corresponds to A on the upper panel. The 
gradient components in ^{x) (for Eq. [25]) and vijj) (for Eq. [55]) are estimated at the Li and Mi 
points. (Figure courtesy of O. Tiret) 
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of gravity by modifying this action in the following way, replacing by a non-linear function of 
it and assuming that it derives from an auxiliary potential gN = — V$Ar, so that the new degree 
of freedom is this new potential: 

S'gravQUMOND = / [2V$.V$Ar - alQ{\V^ / a^)] X dt. (29) 
ottG J 

Varying the total action with respect to $ yields: V^$Ar = AnGp. And varying it with respect to 
the auxiliary (Newtonian) potential yields: 



= V. 



|V$ 



N 



ao 



N 



(30) 



where viy) = Q'{z) and z = . The theory thus requires only to solve twice the Newtonian linear 
Poisson equation, with only one non-linear step in calculating the rhs term of Eq. 1301 For this 
reason, it is called the quasi-linear formulation of MOND (QUMOND). In order to recover the 
i/-function behavior of Milgrom's law (Eq. [T0|) . i.e., v{y) 1 for y ^ 1 and v{y) y~^^'^ for 
y ^ 1, one needs to choose: 

Q(z) ^ z for z > 1 and Q{z) -> -z^l^ for z < 1. (31) 

3 

The general solution of the system of partial differential equations is equivalent to Milgrom's law 
(Eq. nop up to a curl field correction, and is precisely equal to Milgrom's law in highly symmetric 
one-dimensional systems. However, this curl-field correction is different from the one of AQUAL. 
This means that, outside of high symmetry, AQUAL and QUMOND cannot be precisely equivalent. 
An illustration of this is given in }510j : for a system with all its mass in an elliptical shell (in the 
sense of a squashed homogeneous spherical shell) , the effective density of matter that would source 
the MOND force field in Newtonian gravity is uniformly zero in the void inside the shell for 
QUMOND, but nonzero for AQUAL. 

The concept of the effective density of matter that would source the MOND force field in 
Newtonian gravity is extremely useful for an intuitive comprehension of the MOND effect, and/or 
for interpreting MOND in the dark matter language: indeed, subtracting from this effective density 
the baryonic density yields what is called the "phantom dark matter" distribution. In AQUAL, it 
requires deriving the Newtonian Poisson equation after having solved for the MOND one. On the 
other hand, in QUMOND, knowing the Newtonian potential yields direct access to the phantom 
dark matter distribution even before knowing the MOND potential. After choosing a :^-function, 
one defines 

v{y) = v{y) - 1, (32) 
and one has, for the phantom dark matter density. 

This iz-function appears naturally in an alternative formulation of QUMOND where one writes the 
action as a function of an auxiliary potential 'I'ph: 

^gravQUMOND = ^ [|V$|2 - |V$phP - a^i?(|V$ - V$ph|Vao)] ^^2: dt, (34) 

leading to a potential $ph obeying a QUMOND equation with v(vi) = H'[y'^), and $ = $Ar -|- $ph- 
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Numerically, for a given Newtonian potential discretized on a grid of step h, the discretized 
phantom dark matter density is given on grid points (i, j, k) by (see Figure [T71 and of. Eq. [511 see 
also [IT]): 



Pph(ij\fe) - 

[{^N {i+l,j,k) — (i,j,k) )i^Mi — {^N{i,j,k) 



(35) 



N {i-l,j,k) 



+ {'^N (i,] + l,k) - ii,j,k))i^M2 - i^N{i,j,k) - (i,]-lM))'^L2 

+ {^N {i,j,k+l) - (i,j,k) )i^M3 - {^N{i,j,k) - *i>Af(ij\fc-l))t'L3]/(47rG/l^). 

This means that any N-body technique (e.g., treecodes or fast multipole methods) can be adapted 
to QUMOND (a grid being however necessary as an intermediate step). Once the Newtonian 
potential (or force) is locally known, the phantom dark matter density can be computed and 
then represented by weighted particles, whose gravitational attraction can then be computed in 
any traditional manner. An example is given in Figure UHl where one considers a rather typical 
baryonic galaxy model with a small bulge and a large disk. Applying Eq. 1351 (with the z/-function 
of Eq. 133]) then yields the phantom density |254| . Interestingly, this phantom density is composed 
of a round "dark halo" and a flatfish "dark disk" (see |306J for an extensive discussion of how such 
a dark disk component comes about, see also [5T] and Sect. 6.5.2 for observational considerations). 
Let us note that this phantom dark matter density can be slightly separated from the baryonic 
density distribution in non-spherical situations |227j . and that it can be negative |298|I491] . contrary 
to normal dark matter. Finding the signature of such a local negative dark matter density could 
be a way of exhibiting a clear signature of MOND. 



(a) 



(b) 




P^iantDm Dark Matter density [M . kpc '] 
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Figure 18: (a) Baryonic density of a model galaxy made of a small Plummer bulge with a mass 
of 2 X 10^ Mq and Plummer radius of 185 pc, and of a Miyamoto-Nagai disk of 1.1 x 10^° Mq, a 
scale-length of 750 pc and a scale-height of 300 pc. (b) The derived phantom dark matter density 
distribution: it is composed of a spheroidal component similar to a dark matter halo, and of a thin 
disky component (Figure made by Fabian Liighausen |254j ) 



Finally, let us note that, as shown in [319| 1510) . (i) a system made of high-acceleration con- 
stituents, but with a low-acceleration center-of-mass, moves according to a low-acceleration MOND 
law, while (ii) the virial of a system is given by 



--a2Q(|V$w|Vag) + 2K|V$jv|/ao)|V$w| 



(fx, 



(36) 



meaning that for a system entirely in the extremely weak field limit where i/(j/) — y and 
Q{z) = (4/3)2:^/'*, the second term vanishes and we thus get W = (— 2/3)-\/GM^ao, precisely like 
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in Bekenstein-Milgrom MOND. This means that, although the curl-field correction is in general 
different in AQUAL and QUMOND, the two-body force in the deep-MOND limit is the same [5TU] . 



6.2 The interpolating function 

The basis of the MOND paradigm is to reproduce Milgrom's law, Eq. [71 in highly symmetrical 
systems, with an interpolating function asymptotically obeying the conditions of Eq.[51 i.e., — )> 
1 for a; ^ 1 and x for x <^ \. Obviously, in order for the relation between g and ^at to 

be univoquely determined, another constraint is that xfi{x) must be a monotonically increasing 
function of cc, or equivalently 

fi{x) +xfi'{x) > 0, (37) 

or equivalently 

— ^ > -1- (38) 
aliix 

Even though this leaves some freedom for the exact shape of the interpolating function, leading 
to the various families of functions hereafter, let us insist that it is already extremely surprising, 
from the dark matter point of view, that the MOND prescriptions for the asymptotic behavior of 
the interpolating function did predict all the aspects of the dynamics of galaxies listed in Sect. 5. 

As we have seen in Sect. 6.1, an alternative formulation of the MOND paradigm relies on Eg.lTOl 
based on an interpolating function 

vd/) ~ l//i(x) where y = x^{x). (39) 

In that case, we also have that yv{y) must be a monotonically increasing function of y. 

Finally, as we shall see in details in Sect. 7, many MOND relativistic theories boil down to 
multifield theories where the weak-field limit can be represented by a potential $ = (pi, where 
each (pi obeys a generalized Poisson equation, the most common case being 

$ = $Ar+0, (40) 

where obeys the Newtonian Poisson equation and the scalar field (j) (with dimensions of a 
potential) plays the role of the phantom dark matter potential and obeys an equation of either 
the type of Eq. [I7]or of Eq. [501 When it obeys a QUMOND type of equation (Eq. the v- 
function must be replaced by the i>- function of Eq. When it obeys a BM-like equation (Eq. ITT)) . 
the classical interpolating function /i(x) acting on x = |V$|/ao must be replaced by another 
interpolating function jl{s) acting on s — |V0|/ao, in order for the total potential $ to conform 
to Milgrom's la'wF^. In the absence of a renormalization of the gravitational constant, the two 
functions are related through 146 

il{s) — [x — s)s^^ where s — x[l — /i(a;)]. (41) 

For X (the deep-MOND regime), one has s = x{l — x)<^l and x ~ s(l -I- s), yielding ji{s) ~ s, 
i.e., although it is generally different, /2 has the same low-gravity asymptotic behavior as /x. 

In spherical symmetry, all these different formulations can be made equivalent by choosing 
equivalent interpolating functions, but the theories will typically slightly differ outside of spherical 
symmetry (i.e., the curl field will be slightly different). As an example, let us consider a widely 
used interpolating function |142l 11671 14001 1509) yielding excellent fits in the intermediate to weak 
gravity regime of galaxies (but not in the strong gravity regime of the Solar system) , known as the 
"simple" /z-function (see Figure [TO)): 

m(x) - (42) 
1 + a; 



Confusing these 2 interpolating functions lJ.{x) and tj.{s) can lead to serious mistakes | 490| . as illustrated by [42] 
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This yields y — / (1 + x), and thus x = [y + {y"^ + 4j/)]/2, and v ^ {1 + x)/x yields the "simple" 
z^-function: 

l + + 

ny) = ^ ■ (43) 

It also yields s = x[l — ^{x)] — x/{\ + x) — fi, and hence x — s/{l ~ s), yielding for the "simple" 
/i-function: 

m = (44) 

A more general family of /i- functions is known as the a-family |TS], valid for < a < 1 and 
including the simple function as the a = 1 case F^: 

/ia(s) — (45) 

1 — as 

corresponding to the following family of /Lt- functions: 

2x 

^"^""^ " l + (2-a)a;+[(l-a.T)2+4x]i/2 ^"^^^ 

The a = case is sometimes referred to as "Bekenstein's /i- function" (see Figure [T9t as it was used 
in [34] . The problem here is that all these /i- functions approach 1 quite slowly, with C ^ 1 in their 
asymptotic expansion for x — > oo, ij,{x) ^ 1 — Ax^'^ . Indeed, since s — x[l — fJ,{x)], its asymptotic 
behavior is s Ax^''^^. So, if C > 1, s — ^ for a; — > oo as well as for a; 0, which would imply 
that x{s) = sfl{s) + s would be a multivalued function, and that the gravity would be ill-defined. 
This is problematic because even for the extreme case ^ = 1, the anomalous acceleration does 
not go to zero in the strong gravity regime: there is still a constant anomalous "Pioneer-like" 
acceleration x[l — nix)] A, which is observationally excluded^ from very accurate planetary 
ephemerides [155] . What is more, these /i-functions, defined only in the domain < s < a~^, 
would need very carefully chosen boundary conditions to avoid covering values of s outside of the 
allowed domain when solving for the Poisson equation for the scalar field. 

The way out to design /i-functions corresponding to acceptable //-functions in the strong gravity 
regime is to proceed to a renormalization of the gravitational constant [146 : this means that 
the bare value of G in the Poisson and generalized Poisson equations ruling the bare Newtonian 
potential (I)n and the scalar field in Eq. 1401 is different from the gravitational constant measured 
on Earth, Gm (related to the true Newtonian potential $Ar). One can assume that the bare 
gravitational constant G is related to the measured one through 

Gn = CG, (47) 

meaning that x — y + s where x — V^/ao, y ~ ViPn/clq — V<I'Ar/(fao), and sfi{s) = y. We then 
have for Milgrom's law: 

x^i{x) = ^{x - s) = ^sfl{s). (48) 

In order to recover /i(x) — 1 for a; — > oo, it is straightforward to show [146] that it suffices that 
/i(s) — ^ jlo for s oo, and that ^ = 1 -I- fi-Q^- Then if C > 1 in the asymptotic expansion 
^{x) ^ 1 — x^'', one has s ^ (1 + jlQ^)^^x^''^^ + (1 + tio)~^x. This second linear term allows s to 



^®In principle, a can be slightly larger, but if a 3> 1, then in the range of gravities of interest for galaxy dynamics 
(between O.lao and a few times ag) the scalar field contribution s is too small to account for the MOND effect, or 
said in another way, the corresponding Milgrom /^-function would deviate significantly from fJ.{x) = x (i.e., fJ.{x) > x, 
so that there would be less modification to the Newtonian prediction). 

•^''in principle, one could make A = as small as desired in the a-family, by not limiting a to the range between 
and 1, but passing solar system constraints would require a > 20 which would cancel the MOND effect in the 
range of interest for galaxy dynamics. 
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go to infinity for large x and tlius x{s) to be single- valued. On the other hand, for the deep-MOND 
regime, the renormalization of G implies that fl{s) — > s/^ for s <C 1. 

We can then use, even in multifield theories, /^-functions quickly asymptoting to 1. For each 
of these functions, there is a one-parameter family of corresponding /i-functions (labelled by the 
parameter /i(oo) — jio), obtained by inserting /x(a;) into s — x[l — and making sure that 

the function is increasing and thus invertible. A useful family of such /^-functions asymptoting 
more quickly towards 1 than the a-family is the n-family: 

f^nix) = ^ ■ (49) 

The case n = 1 is again the simple /x-function, while the case n = 2 has been extensively used 
in rotation curve analysis from the very first analyses [29l 1224] . to this day [399) . and is thus 
known as the "standard" /i-function (see Figure \T9\ . The corresponding /i-function for n > 2 has 
a very peculiar shape of the type shown in figure 3 of |82j (which might be considered a fine-tuned 
shape, necessary to account for Solar System constraints). On the other hand, the corresponding 
z/-function family is; 



My) 



1 + (1 + 4j/-")i/2 



l/n 



(50) 



As the simple /i-function (a = 1 or n = 1) fits well galaxy rotation curves (see also Sect. 6.5.1) 
but is excluded in the Solar System (see also Sect. 6.4), it can be useful to define /i-functions that 
have a gradual transition similar to the simple function in the low to intermediate gravity regime 
of galaxies, but a more rapid transition towards 1 than the simple function. Two such families are 
described in |326| in terms of their z/-function: 

i.^(y) = (l-e-«)-i/2+/3e-^ (51) 

and 

i^^iy) ^{1- e-y"^)-^h + (1 _ ^-l)e-2'^^^ (52) 

Finally, yet another family was suggested in f275|, obtained by deleting the second term of the 
7-family, and retaining the virtues of the n-family in galaxies, but approaching 1 more quickly in 
the Solar system: 

v,{v) = {\-e-y"^Y^I'. (53) 

To be complete, it should be noted that other /i-functions considered in the literature include |3051 
[506] (see also Sect. 7.10): 

m(2;) - ^ , (54) 

and 

/i(2;) = 1 - (1 + x/3)~^ (55) 

This simply shows the variety of shapes that the interpolating function of MOND can in principle 
takj^. Very precise data for rotation curves, including negligible errors on the distance and on 
the stellar mass-to-light ratios (or, in that case, purely gaseous galaxies) should allow to pin down 
its precise form, at least in the intermediate gravity regime and for "modified inertia" theories 
(Sect. 6.1.1) where Milgrom's law is exact for circular orbits. Nowadays, galaxy data still allow 
some, but not much, wiggle room: they tend to favor the a = n = \ simple function [167] or some 
interpolation between n = 1 and n = 2 |142] , while combined data of galaxies and the Solar System 



''^Note that, among the freedom of choice of that function, one could additionally even imagine that the /i- function 
is not a scalar function but a "tensor" ynj such that the modification becomes anisotropic and the modified Poisson 
equation becomes something like = 47rGp 
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(see Sects. 6.4 and 6.5) rather tend to favor something hke the •y ^ 6 — 1 function of Eq. [52] and 
Eg . [551 (which effectively interpolates between n = 1 and n = 2, see Figure [T^)). although slightlty 
higher exponents (i.e., 7 > 1 or 5 > 1) might still be needed in the weak gravity regime in order 
to pass Solar system tests involving the external field from the Galaxy [63]. Again, it should be 
stressed that the most salient aspect of MOND is however not its precise interpolating function, 
but rather its successful predictions on galactic scaling relations and Kepler-like laws of galactic 
dynamics (Sect. 5.2), as well as its various beneficial effects on, e.g., disk stability (see Sect. 6.5), 
all predicted from its asymptotic form. The very concept of a pre-defined interpolating function 
should even in principle fully disappear once a more profound parent theory of MOND is discovered 
(see also, e.g., [22]). 




X 



Figure 19: Various /x-functions. Dotted green line: the a = "Bekenstein" function of Eq. 35] 
Dashed red line: the a = n = 1 "simple" function of Eq. HHl and Eq. HHl Dot-dashed black line: 
the n=2 "standard" function of Eq. |49j Solid blue line: the j = S = 1 /i-function corresponding to 
the i^- function defined in Eq. |52] and Eq. [53] The latter function closely retains the virtues of the 
n = 1 simple function in galaxies ( x <^ 10 ), but approaches 1 much more quickly and connects 
with the n = 2 standard function as x ^ 10. 

To end this section on the interpolating function, let us stress that if the /i-function asymptotes 
as fJ-{x) = X for x — > 0, then the energy of the gravitational field surrounding a massive body is 
infinite [39] . What is more, if the fL function of relativistic multifield theories asymptotes in the same 
way to zero before going to negative values for time-evolution dominated systems (see Sect. 9.1), 
then a singular surface exists around each galaxy, on which the scalar degree of freedom does not 
propagate, and can therefore not provide a consistent picture of collapsed matter embedded into a 
cosmological background. A simple solution |146i i381j consists in assuming a modified asymptotic 
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behavior of the /^-function, namely of the form 

^{x) ^ eo + X for a; <C 1. (56) 

In that case there is a return to a Newtonian behavior (but with a very strong renormahzed 
gravitational constant Gm/sq) at a very low acceleration scale a; <C £o: and rotation curves of 
galaxies are only approximately flat until the galactocentric radius 



R-K^. (57) 
eo V flo 

One must thus have £q <^ \ to not affect the observed phenomenology in galaxies. Note that the 
/x-function will thus never go to zero, also at the center of a system. Conversely, in QUMOND and 
the likes, one can modify the z^-function in the same way: 

£0 + 2/1/-^ 

6.3 The external field eff'ect 

The above return to a rescaled Newtonian behavior at very large radii and in the central parts 
of isolated systems, in order to avoid theoretical problems with the interpolating function, would 
happen anyway, even with the interpolating function going to zero, for any non-isolated system 
in the Universe (and this return to a Newtonian behavior could actually happen at much lower 
radii) because of a very peculiar aspect of MOND: the external field effect, which appeared in its 
full significance already in the pristine formulation of MOND [294j . 

In practice, no objects are truly isolated in the Universe and this has wider and more subtle 
implications in MOND than in Newton-Einstein gravity. In the linear Newtonian dynamics, the 
internal dynamics of a subsystem (a cluster in a galaxy, or a galaxy in a galaxy cluster for in- 
stance) in the field of its mother system decouples. Namely, the internal dynamics is always the 
same independently of any external field (constant across the subsystem) in which the system is 
embedded (of course, if the external field varies across the subsystem, it manifests itself as tides). 
This has subsequently been built in as a fundamental principle of GR: the Strong Equivalence 
Principle (see also Sect. 7). But MOND has to break this fundamental principle of GR. This is 
because, as it is an acceleration-based theory, what counts is the total gravitational acceleration 
with respect to a pre-defined frame (e.g., the CMB fram . The MOND effects are thus only 
observed in systems where the absolute value of the gravity both internal, g, and external, ge (from 
a host galaxy, or astrophysical system, or large scale structure), is less than ao . If .ge < g < ao 
then we have standard MOND effects. However, if the hierarchy goes as g < < ge, then the 
system is purely NewtoniarF^. and li g < ge < then the system is Newtonian with a renor- 
malised gravitational constant. Ultimately, whenever g falls below ge (which always happens at 
some point) the gravitational attraction falls again as l/r^. This is most easily illustrated in a 
thought experiment where one considers MOND effects in one dimension. In Eq. [T71 one has 
V<I> = g -I- ge and AirGp = V.(gN + gNe)) which in one dimension leads to the following revised 
Milgrom's law (Eq. [7]) including the external field: 



, 9 + 9e . , 

9 M + 9e 

ao 



gN, (59) 



It is interesting to note that different MOND theories offer (very) different answers to the generic question 
"acceleration with respect to what?". For instance, in the MOND-from-vacuum idea (see |305| and Sect. 7.10), the 
total acceleration is measured with respect to the quantum vacuum, which is well defined. In BIMOND (Sect. 7.8) 
it is the relative acceleration between the two metrics, which is also well defined through the difference of Christoffel 
symbols. 

''^A Cavendish experiment in a freely falling satellite in Earth orbit would thus return a Newtonian result in 
MOND 
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such that, when g — >■ 0, we have Newtonian gravity with a renormahzed gravitational constant 
Gnorm ~ G/[/ie(l + L^)] where /Ze = ^{ge/da) and ~ (dln/z/dlna;)^.^^^./^^, assuming as before 
that the external field only varies on a much larger scale than the internal system. Similarly, for 
QUMOND (Eq. [30| in one dimension, one gets the equivalent of Eq. [TO] 



g = 9NV 



ao 



■ 9Ne 



gN + gNc 



9Ne 

ao 



(60) 



When dealing in the future with very extended rotation curves whose last observed point is in the 
extreme weak-field limit, it could be interesting, as a first order approximation, to use the latter 
formulae^, adding the external field as an additional parameter of the MOND fit to the external 
parts of the rotation curve. This would of course only be a first order approximation because this 
would neglect the three-dimensional nature of the problem and the direction of the external field. 

Now, in three dimensions, the problem can be analytically solved only in the extreme case of 
the completely external-field dominated part of the system (where g ^ ge) hy considering the 
perturbation generated by a body of low mass m inside a uniform external field, assumed along 
the z-direction, ge = ge^z- Eq. [T7]can then be linearized and solved with the boundary condition 
that the total field equals the external one at infinity [39j to yield: 



with 



<P{x,y,z) = -, 



(61) 



(62) 



squashing the isopotentials along the external field direction. This is thus the asymptotic behav- 
ior of the gravitational field in any system embedded in a constant external field. Similarly, in 
QUMOND (Eq.[30]), one gets 

$(a:,y,z) = ^, (63) 



with 



f^r/[l + {LNj2){x^+y^)/r\ 



(64) 



where L^Ve = {d\nv / d\ny)y^gj^^/a_^^. 

For the exact behavior of the MOND gravitational field in the regime where g and ge are of 
the same order of magnitude, one again resorts to a numerical solver, both for the BM equation 
case and for the QUMOND case (see Eq [25l and Eq. l35|) . For the BM case, one adds the three 
components of the external field (no longer assumed to be in the z-direction only) in the argument 
of which becomes {[($(5) - ^{A))/h - g^J" + [($(/) + ^{H) - ^{K) - $(J))/(4/i) - g.J + 
[($(C) + ^{D) - <^>{E) - $(F))/(4/i) - geJ2}^/^ and similarly for the other Mi and the U points 
on the grid (Figure [T7| . One also adds the respective component of the external field to the 
term estimating the force at the Mi and Li points in Eq. [25j With Mi for instance, one changes 
($i_l_i^j_fc — ^ij^k) {^i+i,j.k — ^i,j,k — hge^) in the first term of Eq. [25] One then solves this 
discretized equation with the large radius boundary condition for the Dirichlet problem given by 
Eq. [HI] instead of Eq. [201 Exactly the same is applicable to calculating the phantom dark matter 
component of QUMOND with Eq. [351 except that now the Newtonian external field is added to 
the terms of the equation in exactly the same way. 

This external field effect (EFE) is a remarkable property of MONDian theories, and because this 
breaks the strong equivalence principle, it allows us to derive properties of the gravitational field 



'^■'For instance, using the "simple" function fJ,{x) = x/{l + x) in Eq. 1591 would lead to g ■■ 
e) + \/ (Sflefflo +gi - QNao " 9]v9c ) + 4gjv (oo + 9e)^]/[2(ao + 9e)] 
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in which a system is embedded from its internal dynamics (and not only from tides) . For instance 
the return to a Newtonian (Eq. IHI] or Eq. 155)) instead of a logarithmic (Eq. EOj) potential at large 
radii is what defines the escape speed in MOND. By observationally estimating the escape speed 
from a system (e.g., the Milky Way escape speed from our local neighbourhood, see discussion in 
Sect. 6.5.2), one can estimate the amplitude of the external field in which the system is embedded, 
and by measuring the shape of its isopotential contours at large radii, one can determine the 
direction of that external field, without resorting to tidal effects. It is also noticeable that the 
phantom dark matter has a tendency to become negative in "conoidal" regions perpendicular to 
the external field direction (see figure 3 of |491j ): with accurate enough weak-lensing data, detecting 
these pockets of negative phantom densities could in principle be a smoking gun for MOND [491] , 
but such an effect would be extremely sensible to the detailed distribution of the baryonic matter. 
A final important remark about the EFE is that it prevents most possible MOND effects in Galactic 
disk open clusters or in wide binaries, apart from a possible rescaling of the gravitational constant. 
Indeed, for wide binaries located in the Solar neighbourhood, the galactic EFE (coming from 
the distribution of mass in our Galaxy) is about 1.5 x oq. The corresponding rescaling of the 
gravitational constant then depends on the choice of the /i-function, but could typically account 
for up to a 50% increase of the effective gravitational constant. Although this is not properly 
speaking a MOND effect, it could still perhaps imply a systematic offset of mass for very long 
period binaries. However, any effect of the type claimed to be observed by [189] would not be a 
priori expected in MOND due to the external field effect. 

6.4 MOND in the solar system 

The primary place to test modified gravity theories is of course the Solar System, where General 
Relativity has until now passed all the proposed tests. Detecting a deviation from Einsteinian 
gravity in our backyard would actually be the holy grail of modified gravity theories, in the same 
sense as direct detection in the lab is the holy grail of the CDM paradigm. However, MOND 
anomalies typically manifest themselves only in the weak-gravity regime, several orders of mag- 
nitudes below the typical gravitational field exerted by the Sun on, e.g., the inner planets. But 
in the case of modified inertia (Sect. 6.1.1), the anomalous acceleration at any location depends 
on properties of the whole orbit (non-locality), so that anomalies may appear in the motion of 
Solar system bodies that are on highly eccentric trajectories taking them to large distances (e.g., 
long period comets or the Pioneer spacecraft), where accelerations are low [315] . Such MOND 
effects have been proposed as a possible mechanism for generating the Pioneer anomaly |315l 1470) , 
without affecting the motions of planets, whose orbits are fully in the high acceleration regime. 
On the other hand, in classical, non-relativistic modified gravity theories (Sects. 6.1.2 and 6.1.3), 
small effects could still be observable and would primarily probe two aspects of the theory: (i) the 
shape of the interpolating function (Sect. 6.2) in the regime x':^ 1, and (ii) the external Galactic 
gravitational field (Sect. 6.3) acting on the Solar system, testing the interpolating function in the 
regime a; ^ 1. 

If, as a first approximation, one considers the Solar system as isolated, and the Sun as a point 
mass, the MOND effect in the inner Solar system appears as an anomalous acceleration field in 
addition to the Newtonian one. In units of oq, the amplitude of the anomalous acceleration is 
given by a;[l — /i(x)], which can be constrained from the motion of the inner planets, typically their 
perihelion precession and the (non)-variation of Kepler's constant |294! 1391) 1418) . These constraints 
typically exclude the whole a-family of interpolating functions (Eq. I46p that are natural for multi- 
field theories such as TeVeS (see Sect. 6.2 and Sect. 7) becaus e they yield x[l — ^J.[x)\ > 1 for 
a; ^ 1 while it must be smaller than 0.04 at the orbit of Mars 139 ipH This of course does not 

•^^See also |204) and constraints excluding such functions also from Lunar Laser Ranging |139) . neglecting the 
external field effect from the Sun on the Earth-Moon system since it is 3 orders of magnitude below the internal 
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mean that the /i-function cannot be represented by the a-family in the intermediate gravity regime 
characterizing galaxies, but it must be modified in the strong gravity regime^. Another potential 
effect of MOND is anomalously strong tidal stresses in the vicinity of saddle points of the Newtonian 
potential, which might be tested with the LISA pathfinder [38] [50] [2561 HIS] • The MOND bubble 
is typically quite big and clearly detectable, but some fine-tuned interpolating functions could still 
make the effect rather small and unobservable I256| 1162] . 

The approximation of an isolated Solar system being incorrect, it is also important to add the 
effect of the external field from the Galaxy. Its amplitude is typically of the order of ^ 1.5 x ag. 
From there, Milgrom f315' has predicted (both analytically and numerically) a subtle anomaly 
in the form of a quadrupole field that may be detected in planetary and spacecraft motions (as 
subsequently confirmed by [63 |ll86j ). This has been used to constrain the form of the interpolating 
function in the weak acceleration regime characteristic of the external field itself. Constraints have 
essentially been set on the n- family of /i-functions from the perihelion precession of Saturn [6411155] . 
namely that one must have n > 8 in order to fit these datgPI. 

It should however be noted that it is slightly incoherent to compare the classical predictions 
of MOND with observational constraints obtained by a global fit of Solar System orbits using a 
fully relativistic first-post-Newtonian model. Although the above constraints on classical MOND 
models are useful guides, proper constraints can thus only truly be set on the various relativistic 
theories presented in Sect. 7, the first order constraints on these theories coming from their own 
post-newtonian parameters [66l llOOl 11741 13731 13911 1451) . What is more, and makes all these tests 
perhaps unnecessary, it has recently been shown that it was possible to cancel any deviation from 
General Relativity at small distances in most of these relativistic theories, independently of the 
form of the /i-function |22j . 

6.5 MOND in rotationally supported stellar systems 
6.5.1 Rotation curves of disk galsLxies 

The root and heart of MOND, as modified inertia or modified gravity, is Milgrom's formula (Eq.[7]). 
Up to some small corrections outside of symmetrical situations, this formula yields (once ag and 
the form of the transition function fi arc chosen) a unique prediction for the total effective gravity 
as a function of the gravity produced by the visible baryons. It is absolutely remarkable that this 
formula, devised 30 years ago, has been able to successfully predict an impressive number of galactic 
scaling relations (the "Kepler-like" laws of Sect. 5.2, backed by the modern data of Sect. 4.3) that 
were very unprecise and/or unobserved at the time, and which still are a puzzle to understand in 
the ACDM framework. What is more, this formula is not only predicting global scaling relations 
successfully, we show in this section that it also predicts the shape and amplitude of galactic rotation 
curves at all radii with uncanny precision, and this for all disk galaxy Hubble types [16911400] . Of 
course, the absolute exact prediction of MOND depends on the exact formulation of MOND (as 
modified inertia or some form or other of modified gravity) , but the differences are small compared 
to observational error bars, and even compared with the differences between various /i-functions. 

In order to illustrate this, we plot in Figure[2n]the theoretical rotation curve of an HSB exponen- 
tial disk (see [146] for exact parameters) computed with three different formulations of MONeF^: 
Milgrom's formula (Eq. [7]), representative of circular orbits in modified inertia, AQUAL (Eq. [TT]) . 

gravity of the system. 

This is why, although the "simple" a = 1 function is known to very well represent the gravitational field of 
spiral galaxies |142l 11671 14001 l509j . we hereafter, in Sect. 6.5.1, rather use the the 7 = 5 = 1 function of Eo. 1521 and 
Eg. I53l in order to fit spiral galaxy rotation curves. 

For the 7-family of Eq. 1521 and (5-family of Eq. 1531 it means that even slightly sharper transitions than 7 = 1 
and (5=1 might still be needed. 

Note that the rotation curves of Figure [20l become flat only at larger radii than shown here, see |146| . 
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Figure 20: Comparison of theoretical rotation curves for the inner parts (before the rotation curve 
flattens) of an HSB exponential disk |146| . computed with three different formulations of MOND. 
Green: Milgrom's formula; Blue: Bekenstein-Milgrom MOND (AQUAL); Red: TeVeS-like muhi- 
field theory. 




Figure 21: Examples of detailed MOND rotation curve fits of the HSB and LSB galaxies of Fig- 
ure[T3](NGC 6946 on the left and NGC 1560 on the right). The black line represents the Newtonian 
contribution of stars and gas as determined by numerical solution of the Newtonian Poisson equa- 
tion for the observed light distribution, as per Figure [T3] The blue line is the MOND fit with the 
7 = (5 = 1 function of Eq. [52] and Eq. [53l the only free parameter being the stellar mass-to-light 
ratio. In the K-hand, the best fit value is 0.37Mq/Lq for NGC 6946 and 0.18Mq/Lq for NGC 
1560. In practice, the best fit mass-to-light ratio can co-vary with the distance to the galaxy and 
Oo; here Oq is held fixed (1.2 x 10~^" ms~^) and the distance has been held fixed to the best ob- 
served value (5.9 Mpc for NGC 6946 and 3.45 Mpc for NGC 1560 f^lD]). Milgrom's formula 
provides an effective mapping between the rotation curve predicted by the observed baryons and 
the observed rotation, including the bumps and wiggles. 
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Figure 22: The rotation curve 11251 and MOND fit [ SSi\ of the Local Group spiral M33 assuming a 
constant stellar mass-to-light ratio (top panel) . While the overall shape is a good match, there is a 
slight mismatch at ~ 3 kpc and above 7 kpc. The observed color gradient implies a slight variation 
in the mass-to-light ratio, in the sense that the stars at small radii are slightly redder and heavier 
than those at large radii. Applying stellar population models [43] to the observed color gradient 
produces a slight adjustment of the Newtonian mass model. The dotted line in the lower panel 
reiterates the constant M/L model from the top panel while the solid line has been corrected for 
the observed color gradient. This slight adjustment to the baryonic mass distribution considerably 
improves the fit. 
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Figure 23: Residuals of MOND fits to the rotation curves of 78 nearby galaxies (all data to which 
authors have access) including about two thousand individual resolved measurements. Data for 21 
galaxies are either new or improved in terms of spatial resolution and velocity accuracy over those 
in |399] . More accurate points are illustrated with larger symbols. The histogram of residuals is 
plotted on the right panel, and is well fitted by a Gaussian of width Aw/w ^ 0.04. The bulk of the 
more accurate data are in good accord with MOND. There are a few deviant points, mostly at small 
radii where non-circular motions are ubiquitous and observational resolution (beam smearing) can 
be a challenge. These are but a few trees outlying from a very clear forest. 
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and a multi-field theory (Eq.l40|) representative of a whole class of relativistic theories (see Sects. 7.1 
to 7.4) , all with the a = n = 1 "simple" /x-function of Eq. 251 and Eq. One can see velocity dif- 
ferences of only a few percents in this case, while, in general, it has been shown that the maximum 
difference between formulations is of the order of 10% for any type of disk [TTJ. This justifies using 
Milgrom's formula as a proxy for MOND predictions on rotation curves, keeping in mind that, in 
order to constrain MOND within the modified gravity framework, one should actually calculate 
predictions of the various modified Poisson formulations of Sect. 6.1 for each galaxy model, and 
for each choice of galaxy parameters [15] . 
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Figure 24: Examples of MOND fits (blue lines, using Eq. [53] with (5 = 1) to two massive galax- 
ies |400j . With baryonic masses in excess of 10^^ Mq, these are among the most massive, rapidly 
rotating disk galaxies known. Stars dominate the mass, and Newtonian dynamics suffices to ex- 
plain the innermost regions because of the high acceleration, but the mass discrepancy becomes 
apparent as the Keplerian decline (black lines) falls well below the data at the enormous radii 
spanned by these giant disks (the diameter of UGC 2487 spans half a million light-years). 

The procedure is then the following (see also Sect. 4.3.4 for more details). One usually assumes 
that light traces stellar mass (constant mass-to-light ratio, but see hereafter the counter-example 
M33), and one adds to this baryonic density the contribution of observed neutral hydrogen, scaled 
up to account for the contribution of primordial helium. The Newtonian gravitational force of 
baryons is then calculated via the Newtonian Poisson equation, and the MOND force is simply 
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Figure 25: Examples of MOND fits (blue lines) to two dwarf galaxies [323 . The data for DDO 210 
come from [50], and those for UGC 11583 (also known as KK98 250) are from [3T] augmented with 
high resolution data from |2811 1243] . The high gas content of these galaxies make them strong 
tests of MOND, as the one fit-parameter - the mass-to-light ratio of the stars - has only a minor 
impact on the fit. What is more, as they are deep in the MOND regime, the exact form of the 
interpolating function (Sect. 6.2) has also little impact on the fits, making them the cleanest tests 
of MOND, with essentially no wiggle room. Note that, with a mass of only a few million solar 
masses (comparable in mass to the largest globular clusters), the Local Group dwarf DDO 210 is 
the smallest galaxy known to show clear rotation (V/ ^ 15 km/s). It is the lowest point in Figure[3l 
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Figure 26: MOND rotation curve fits for representative galaxies from tlic THINGS survey [1221 
11671 1482) . Galaxies are chosen to illustrate a broad range of mass, from Mi, ^ 3 x 10^ Mq to 
^ 3 X 10"'^^ Mq. All galaxies have high resolution interferometric 21 cm data for the gas and 3.6/i 
photometry for mapping the stars. The Newtonian baryonic mass model is shown as a black line 
and the MOND fit as a blue line (as in Figure [?T|) . The fits use the interpolating function of Eg. [551 
with 6 = 1. 
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Figure 27: MOND rotation curve fits for low surface briglitness galaxies [121] updated with high 
resolution Ha data |243[ 1242] and using Eq. [53] with 6=1. Low surface brightness galaxies 
are important tests of MOND because their low surface densities (E <^ ao/G) place them well 
into the MOND regime everywhere, and the exact form of the interpolating function is rather 
unimportant. Their baryonic mass models fall well short of explaining the observed rotation at 
any but the smallest radii in Newtonian dynamics, and MOND nevertheless provides the necessary 
additional force everywhere (lines as per Figure I^Tj) . 
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Figure 28: A comparison of the mass-to-light ratios obtained from MOND rotation curve fits 
(points) with the independent expectations of stellar population synthesis models (lines) |43] . The 
mass-to-light ratio in the optical (blue i?-band, left) and near-infrared (2.2 /im ivT-band, right) are 
shown as a function oi B — V color (the ratio of blue to green light). The one free parameter of 
MOND rotation curve fits reproduces the normalization, slope, and scatter expected from what 
we know about stars. Not all galaxies illustrated here have both B and i^-band data. Some have 
neither, instead having photometry in some other bandpass (e.g., V ov R or /). 



obtained via Eq. [7] or Eq. 1101 First of all, an interpolating function must be chosen, then one 
can determine the value of ag by fitting, all at once, a sample of high-quality rotation curves with 
small distance uncertainties and no obvious non-circular motions. Then all individual rotation 
curve fits can be performed with the mass-to-light ratio of the disk as the single free parameter 
of the filF^. It turns out that using the simple interpolating function (a = rt = 1, see Eqs. 
and l49p yields a value of oq = 1.2 x 10~^''ms~^, and excellent fits to galaxy rotation curves [167] . 
However, as already pointed out in Sects. 6.3 and 6.4, this interpolating function yields too strong 
a modification in the Solar System, so we hereafter rather use the 7 = (5 = 1 interpolating function 
of Eqs. [52] and [53] (solid blue line on Figure [19]), very similar to the simple interpolating function 
in the intermediate to weak gravity regime. 

Figure [21] shows two examples of detailed MOND fits to rotation curves of Figure [13] The 
black line represents the Newtonian contribution of stars and gas and the blue line is the MOND 
fit, the only free parameter being the stellar mass-to-light raticFl. Not only does MOND predict 
the general trend for low surface brightness (LSB) and high surface-brightness (HSB) galaxies, 
it also predicts the observed rotation curves in great detail. This procedure has been carried 
out for 78 nearby galaxies (all galaxy rotation curves to which the authors have access), and the 
residuals between the observed and predicted velocities, at every point in all these galaxies (thus 
about two thousand individual measurements) , are plotted in Figure [23] As an illustration of the 
variety and richness of rotation curves fitted by MOND, as well as of the range of magnitude of 
the discrepancies covered, we display in Figure [24] fits to rotation curves of extremely massive 
HSB early-type disk galaxies [400] with Vf up to 400 km/s, and in Figure fits to very low 
mass LSB galaxies |325| with Vf down to 15 km/s. In the latter, gas-rich, small galaxies, the 
detailed fits are insensitive to the exact form of the interpolating function (Sect. 6.2) and to the 
stellar mass-to- light ratio |169| 1325] . We then display in Figure [26] eight fits for representative 

^^If one assumes that a lot of dark baryons are present in the form of molecular gas, one can add another free 
parameter in the form of a factor multiplying the gas mass [461]. Good MOND fits can then still be obtained but 
with a lower value of ao . 

The mass-to-light ratio is not really a constant in galaxies, however. Figure [22l thus gives an example of a 
rotation curve fit (to the Local Group galaxy M33), where the variation of the mass-to-light ratio according to the 
color-gradient has been included, even improving the MOND fit. 
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galaxies from the latest high-resolution THINGS survey |1671 1482) , and in Figure [27] six fits of 
yet other LSB galaxies (as these provide strong tests of MOND and depend less on the exact 
form of the interpolating function than HSB ones) from |121| . updated with high resolution Ha 
data |2431 1242] . The overall results for the whole 78 nearby galaxies (Figure [23)1 are globally 
very impressive, although there are a few outliers among the 2000 measurements. These are 
but a few trees outlying from a very clear forest. It is actually only as the quality of the data 
decline [384] that one begins to notice small disparities. These are sometimes attributable to 
external disturbances that invalidate the assumption of equilibrium |401] , non-circular motions or 
bad observational resolution. For targets that are intrinsically difficult to observe, minor problems 
become more common |121l 1449) . These typically have to do with the challenges inherent in 
combining disparate astronomical data sets (e.g., rotation curves measured independently at optical 
and radio wavelengths) and constraining the inclinations. A single individual galaxy that can be 
considered as a bit problematic is NGC 3198 ^69t il67j . but this could simply be due to a problem 
with the potentially too high Cepheids-based distance (reddening problem mentioned in j255] ). 
Indeed, the adopted distance plays an important role in the MOND fitting procedure, as the 
value of the centripetal acceleration / R depends on the distance through the conversion of the 
observed angular radius in arcsec into the physical radius R in kpc. Note that other galaxies such 
as NGC 2841 had historically posed problems to MOND but that these have largely gone away 
with modern data (see [167) and Figure !^ . 

We finally note that what makes all these rotation curve fits really impressive is that either (i) 
stellar mass-to-light ratios are unimportant (in the case of gas-rich galaxies) yielding excellent fits 
with essentially zero free parameters (apart from some wiggle room on the distance) , or (ii) stellar 
mass-to-light ratios are important, and their best-fit value, obtained on purely dynamical grounds 
assuming MOND, vary with galaxy color as one would expect on purely astrophysical grounds 
from stellar population synthesis models [43]. There is absolutely nothing built into MOND that 
would require that redder galaxies should have higher stellar mass-to-light ratios in the -B-band, 
but this is what the rotation curve fits require. This is shown on Figure [28] where the best-fit 
mass-to- light ratio in the i3-band is plotted against B — V color index (left panel), and the same 
for the X-band (right panel). 

6.5.2 The Milky Way 

Our own Milky Way galaxy (a HSB galaxy) is a unique laboratory within which present and 
future surveys will allow us to perform many precision tests of MOND (at a level of precision that 
might even discriminate between the various versions of MOND described in Sect. 6.1) that are not 
feasible with external galaxies. Concerning the rotation curve however, the test is at present not the 
most conclusive, as the outer rotation curve of the Milky Way is paradoxically much less precisely 
known than that of external galaxies (the forthcoming Gaia mission should allow to improve this 
situation, although the rotation curve will not be measured directly). Nevertheless, past studies of 
the inner rotation curve of the Milky Way |142[ll43ll275) . measured with the tangent point method, 
compared to the baryonic content of the inner Galaxy [S3J 1156) , have shown full agreement between 
the rotation curve and MOND, assuming as usual the simple interpolating function (a = n = 1 in 
Egs. 1461 and I49p or the j — 5 ~ 1 interpolating function (Egs. [52] and I53p . The inverse problem 
was also tackled, i.e., deriving the surface density of the inner Milky Way disk from its rotation 
curve (see Figure [29]): this exercise [275) led to a derived surface density fully consistent with star 
count data, and also even reproducing the details of bumps and wiggles in the surface brightness 
(Renzo's rule, Sect. 4.3.4), while being fully consistent with the (somewhat imprecise) constraints 
on the outer rotation curve of the Galaxy [495) . 

However, especially with the advent of present and future astrometric and spectroscopic surveys, 
the Milky Way offers a unique opportunity to test many other predictions of MOND. These include 
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Figure 29: The mass distribution of the Milky Way disk (left) inferred from fitting in MOND the 
observed bumps and wiggles in the rotation curve of the Galaxy (right) [275) . The Newtonian 
contributions of the stellar and gas disk are shown as dashed and dotted lines as per Figure [T51 
The resulting model is consistent with independent star count data |156) and compares favorably 
to constraints on the rotation curve at radii beyond those included in the fit [495] . The prominent 
feature at i? « 6 kpc corresponds to the Centaurus spiral arm. 



the effect of the "phantom dark disk" (see Figure [T5]) on vertical velocity dispersions and on the tilt 
of the stellar velocity ellipsoid, the precise shape of tidal streams around the Galaxy, or the effects 
of the external gravitational field in which the Milky Way is embedded on fundamental parameters 
such as the local escape speed. All these predictions can however slightly vary depending on 
the exact formulation of MOND (mainly Bekenstein-Milgrom MOND, QUMOND, or multi- field 
theories, the predictions being anyway difficult to make in modified inertia versions of MOND 
when non-circular orbits are considered). Most of the predictions made until today and reviewed 
hereafter have been using the Bekenstein-Milgrom version of MOND (Eq. [T7| . 

Based on the baryonic distribution from, e.g., the Besangon model of the Milky Way [367] . one 
can compute the MOND gravitational field of the Galaxy by solving the BM-equation (Eq. [T7)) . 
This has been done in [491) . Then one can apply the Newtonian Poisson equation to it, in order 
to find back the density distribution that would have yielded this potential within Newtonian 
dynamics [51] 1141) . In this context, as already shown (Figure [TSt . MOND predicts a disk of 
"phantom dark matter" allowing one to clearly differentiate it from a Newtonian model with a 
dark halo: 

(i) By measuring the force perpendicular to the Galactic plane: at the Solar radius, MOND 
predicts a 60 percent enhancement of the dynamical surface density at 1.1 kpc above the plane 
compared to the baryonic surface density, a value in agreement with current data (Table 1, 
see also [340] ). The enhancement would become more apparent at large galactocentric radii 
where the stellar disk mass density becomes negligible. 

(ii) By determining dynamically the scale length of the disk mass density distribution. This scale 
length is a factor ^ 1.25 larger than the scale length of the visible stellar disk if Bekenstein- 
Milgrom MOND applies. Such a test could be applied with existing RAVE data |424| . but 
the accuracy of available proper motions still limits the possibility to explore the gravitational 
forces too far from the Solar neighbourhood. 

(iii) By measuring the velocity ellipsoid tilt angle within the meridional galactic plane. This tilt 
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is different within the MOND and Ncwton+dark halo cases in the inner part of the Galactic 
disk. The tilt of about 6 degrees at z—1 kpc at the Solar radius is in agreement with the 
recent determination of 7.3 ±1.8 degrees obtained by [423] . The difference between MOND 
and a Newtonian model with a spherical halo becomes significant at z=2 kpc. Interestingly, 
recent data [329] on the tilt of the velocity ellipsoid at these heights clearly favor the MOND 
prediction |51j . 

Such tests of MOND could be applied with the first release of future Gaia data. To fix the 
ideas on the current local constraints, the predictions of the Besangon MOND model are compared 
with the relevant observations in Table [T] Let us however note that these predictions are extremely 
dependent on the baryonic content of the model [54l 11561 1367) . so that testing MOND at the 
precision available in the Milky Way heavily relies on star counts, stellar population synthesis, 
census of the gaseous content (including molecular gas), and inhomogeneities in the baryonic 
distribution (clusters, gas clouds). 

Another test of the predictions of MOND for the gravitational potential of the Milky Way 
is the thickness of the HI layer as a function of position in the disk (see also Sect. 6. 5. 3): it has 
been found [379] that Bekenstein-Milgrom MOND and it phantom disk successfully accounts for 
the most recent and acurate flaring of the HI layer beyond 17 kpc from the center, but that it 
slightly underpredicts the scale-height in the region between 10 and 15 kpc. This could indicate 
that the local stellar surface density in this region should be slightly smaller than usually assumed, 
in order for MOND to predict a less massive phantom disk and hence a thicker HI layer. Another 
explanation for this discrepancy would rely on non-gravitational phenomena, namely ordered and 
small-scale magnetic fields and cosmic rays contributing to support the disk. 

Yet another test would be the comparison of the observed Sagittarius stream [1991 1249j with 
the predictions made for a disrupting galaxy satellite in the MOND potential of the Milky Way. 
Basic comparisons of the stream with the orbit of a point mass has shown accordance at the zeroth 
order |359j . In reality, such an analysis is not straightforward because streams do not delineate 
orbits, and because of the non-linearity of MOND. Combining a MOND N-body code with a 
Bayesian technique |475j in order to efficiently explore the parameter space, it should however be 
possible to rigorously test MOND with such data in the near future, including for external galaxies, 
which will thus lead to an exciting battery of new observational tests of MOND. 

Finally, a last test of MOND in the Milky Way involves the external field effect of Sect. 6.3. 
As explained there, the return to a Newtonian (Eq. [6T]or Eg. |63)) instead of a logarithmic (Eg. I20p 
potential at large radii is defining the escape speed in MOND. By observationally estimating the 
escape speed from a system (e.g., the Milky Way escape speed from our local neighbourhood), one 
can estimate the amplitude of the external field in which the system is embedded. With simple 
analytical arguments, it was found |145j that with an external field of O.Olao, the local escape 
speed at the Sun's radius was about 550 km/s exactly as observed (within the observational error 
range [434] ) . This was later confirmed by rigorous modeling in the context of Bekenstein-Milgrom 
MOND and with the Besangon baryonic model of the Milky Way j493) . This value of the external 
field, 10~^ X oq, corresponds to the order of magnitude of the gravitational field exerted by Large 
Scale Structure, estimated from the acceleration endured by the Local Group during a Hubble 
time in order to attain a peculiar velocity of 600 km/s. 

6.5.3 Disk stability and interacting galaxies 

A lot of questions in galaxy dynamics require using N-body codes. This is notably necessary 
for studying stability of galaxy disks, the formation of bars and spirals, or highly time-varying 
configurations such as galaxy mergers. As we have seen in Sect. 6.1.2, the BM modified Poisson 
equation (Eq. [T7| can be solved numerically using various methods [51] [78] [97l 11481 12511 1458) . Such 
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Table 1: Values predicted from the Besangon model of the Milky Way in MOND as seen by a 
Newtonist (i.e., in terms of phantom dark matter contributions) compared to current observational 
constraints in the Milky Way, for the local dynamical surface density and the tilt of the stellar 
velocity ellipsoid [51]. Predictions for a round dark halo without a dark disk are also compatible 
with the current constraints, though |1951I423] . The tilt at z = 2 kpc should be more discriminating. 





MOND predictions 


Observations 


Surface density 'Sq{z = 1.1 kpc) 


78Mq/pc2 


74±6M0/pc2 [inS] 


Velocity ellipsoid tilt ai z — 1 kpc 


6 degrees 


7.3 ± 1.8 degrees |423J 



a Poisson solver can then be used in particle-mesh N-body codes. More general codes based on 
QUMOND (Sect 6.1.3) are currently under development. 

The main results obtained via these simulations are the following (the comparison with obser- 
vations will be discussed below): 

(i) LSB disks are more unstable regarding bar and spiral instabilities in MOND than in the 
Newton+sphcrical halo equivalent case, 

(ii) Bars always tend to appear more quickly in MOND than in the Newton+spherical halo 
equivalent, and are not slowed down by dynamical friction, leading to fast bars, 

(iii) LSB disks can be both very thin and extended in MOND thanks to the effect of the "phantom 
disk", and vertical velocity dispersions level off at 8 km/s, instead of 2 km/s for Newtonian 
disks, 

(iv) Warps can be created in apparently isolated galaxies from the external field effect of large 
scale structure in MOND, 

(v) Merging time-scales are longer in MOND for interacting galaxies, 

(vi) Reproducing interacting systems such as the Antennae require relatively fine-tuned initial 
conditions in MOND, but the resulting galaxy is more extended and thus closer to observa- 
tions, thanks to the absence of angular momentum transfer to the dark halo. 

Concerning the first point (i), Brada & Milgrom [75^ investigated the important problem of 
stability of disk galaxies. They demonstrated that MOND, as anticipated [300 , has an effect 
similar to a dark halo in stabilizing a rotationally supported disk, thereby explaining the upper 
limit in surface density seen in the data (Sect. 4.3.2), and also showing how it damps the growth- 
rate of bar-forming modes in the weak gravitational field regime. In a comparison of MOND disks 
with the equivalent Newtonian-|-halo counterpart (with identical rotation curves), they found that, 
as the surface density of the disk decreases, the growth-rate of the bar-forming mode decreases 
similarly in both cases. However, in the limit of very low surface densities, typical of LSB galaxies, 
the MOND growth rate stops decreasing, contrary to the Newton-|-dark halo case (Figure |30)) . 
This could provide a solution to the stability challenge of Sect. 4.2, as observed LSBs do exhibit 
bars and spirals, which would require an ad hoc dark component within the self-gravitating disk 
of the Newtonian system. One can also see on this figure that if the surface density is typical of 
intermediate HSB galaxies, the bar systematically forms quicker in MOND. 

This was confirmed in recent simulations |105l 1458] , where it was additionally found that (ii) 
the bar is sustained longer, and is not slowed down by dynamical friction against the dark halo, 
which leads to fast bars, consistent with the observed fast bars in disk galaxies (measured through 
the position of resonances). When gas inflow and external gas accretion are included, however. 
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Figure 30: The scaled growth-rate of the m — 2 instabihty in Newtonian disks with a dark halo 
(dotted line) and MONDian disks (solid line) as a function of disk mass. In the MOND case, as 
the disk mass decreases, the surface density decreases and the disk sinks deeper into the MOND 
regime. At very low masses, however, the growth-rate saturates. In the equivalent Newtonian 
case, the rotation curve is maintained at the MOND level by supplementing the force with a round 
stabilizing dark halo which causes the growth-rate to crash [78] I399j . An ad-hoc dark disk could 
help maintain the growth rate in the dark matter context. 



a larger range of situations are met regarding pattern speeds in MOND, all compatible with 
observations [459] . Since the bar pattern speed has a tendency to stay constant, the resonances 
remain at the same positions, and particles are trapped on these orbits more easily than in the 
Newtonian case, which leads to the formation of rings and pseudo-rings as observed (see Fig [21] 
and Figure [32]) . All these results have been shown to be rather independent of the exact choice of 
interpolating /i- function [459 . 

What is more, (iii) LSB disks can be both very thin and extended in MOND thanks to the 
stabilizing effect of the "phantom disk", and vertical velocity dispersions level off at 8 km/s, as 
typically observed [261 1242] . instead of 2 km/s for Newtonian disks with E = 1 Mq pc~^ (depending 
on the thickness of the disk). However, the observed value is usually attributed to non-gravitational 
phenomena. Note that [279] utilized this fact to predict that conventional analyses of LSB disks 
would infer abnormally high mass-to-light ratios for their stellar populations - a prediction that 
was subsequently confirmed [160| 1372] . But let us also note that this stabilizing effect of the 
phantom disk, leading to very thin stellar and gaseous layers, could even be too strong in the 
region between 10 and 15 kpc from the galactic center in the Milky Way (see Sect. 6.5.2), and 
in external galaxies |498j . even though, as said, non-gravitational effects such as ordered and 
small-scale magnetic fields and cosmic rays could significantly contribute to the prediction in these 
regions. 

Via these simulations, it has also been shown (iv) that the external field effect of MOND 
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(Sect. 6.3) offers a mechanism other than the relatively weak effect of tides in inducing and main- 
taining warps [SO]. It was demonstrated that a satellite at the position and with the mass of the 
Magellanic clouds can produce a warp in the plane of the Galaxy with the right amplitude and 
form [80] , and even more importantly, that isolated galaxies could be affected by the external field 
of large scale structure, inducing a differential precession over the disk, in turn causing a warp [105] . 
This could provide a new explanation for the puzzle of isolated warped galaxies. 

Interactions and mergers of galaxies are (v) very important in the cosmological context of galaxy 
formation (see also Sect. 9.2). It has been found [HS] from analytical arguments that dynamical 
friction should be much more efficient in MOND, for instance for bar slowing down or mergers 
occuring more quickly. But simulations display exactly the opposite effect, in the sense of bars not 
slowing down and merger time-scales being much larger in MOND [3391 1460"] . Concerning bars, 
Nipoti 336' found that they were indeed slowed down more in MOND, as predicted analytically [96] . 
but this is because their bars were unrealistically small compared to observed ones. In reality, the 
bar takes up a significant fraction of the baryonic mass, and the reservoir of particles to interact 
with, assumed infinite in the case of the analytic treatment |96j , is in reality insufficient to affect the 
bar pattern speed in MOND. Concerning long merging time-scales, an important constraint from 
this would be that, in a MONDian cosmology, there should perhaps be less mergers, but longer 
ones than in ACDM, in order to keep the total observed amount of interacting galaxies unchanged. 
This is indeed what is expected (see Sect. 9.2). What is more, the long merging time-scales would 
imply that compact galaxy groups do not evolve statistically over more than a crossing time. In 
contrast, in the Newtonian+dark halo case, the merging time scale would be about one crossing 
time because of dynamical friction, such that compact galaxy groups ought to undergo significant 
merging over a crossing time, contrary to what is observed [240] . Let us also note that, in MOND, 
many passages in binary galaxies will happen before the final merging, with a starburst triggered 
at each passage, meaning that the number of observed starbursts as a function of redshift cannot 
be used as an estimate of the number of mergers [105] . 

Finally, (vi) at a more detailed level, the Antennae system, the prototype of a major merger, 
has been shown to be nicely reproducible in MOND 460 . This is illustrated on Figure |33j On the 
contrary, while it is well established that CDM models can result in nice tidal tails, it turns out 
to be difficult to simultaneously match the narrow morphology of many observed tidal tails with 
rotation curves of the systems from which they come ^131i . In MOND, reproducing the Antennae 
requires relatively fine-tuned initial conditions, but the resulting tidal tails are narrow and the 
galaxy is more extended and thus closer to observations than with CDM, thanks to the absence 
of angular momentum transfer to the dark halo (solution to the angular momentum challenge of 
Sect. 4.2). 

6.5.4 Tidal dv^rarf galaxies 

As seen in, e.g.. Figure [33l left panel, major mergers between spiral galaxies are frequently observed 
with dwarf galaxies at the extremity of their tidal tails, called Tidal Dwarf Galaxies (TDG). These 
young objects are formed through gravitational instabilities within the tidal tails, leading to local 
collapse of gas and star formation. These objects are very common in interacting systems: in some 
cases dozens of such condensations are seen in the tidal tails, with a few ones having a mass typical 
of other dwarf galaxies in the Universe. In the ACDM model, however, these objects are difficult 
to form, and require very extended dark matter distribution [72] . In MOND simulations [460| 1105] , 
however, the exchange of angular momentum occurs within the disks, whose sizes are inflated. For 
this reason, it is much easier with MOND to form TDGs in extended tidal tails. 

What is more, in the ACDM context, these objects are not expected to drag CDM around 
them, the reason being that these objects are formed out of the material in the tidal tails, itself 
made of the dynamically cold, rotating, material in the progenitor disk galaxies. In these disks, the 
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Figure 31: (a) The galaxy ESO 509-98 (b) The galaxy NGC 1543. These are two examples of 
galaxies that exhibit clear ring and pseudo-ring structures. 




Figure 32: Simulations of ESO 509-98 and NGC 1543 in MOND, to be compared with Figure EH 
Rings and pseudo-rings structures are well reproduced with modified gravity (Figure courtesy of 
O. Tiret). 
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Figure 33: Simulation of The Antennae with MOND (right, |460] ) compared to the observations 
(left, [191) ). In the observations, the gas is represented in blue and the stars in green. In the 
simulation the gas is in blue and the stars are in yellow/red. (Figure courtesy of O. Tiret) 
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Figure 34: The NGC 5291 system [73]. VLA atomic hydrogen 21-cm map (blue) superimposed on 
an optical image (white). The UV emission observed by GALEX (red) traces dense star- forming 
concentrations. The most massive of these objects are rotating with the projected spin axis as 
indicated by dashed arrows. The three most massive ones are denoted as NGC5291N, NGC5291S, 
and NGC5291W. Figure courtesy of F. Bournaud. 
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Figure 35: Rotation curves of the three TDGs in the NGC 5291 system. In red: ACDM prediction 
(with no additional cold molecular gas), with the associated uncertainties. In black: MOND 
prediction with the associated uncertainties (prediction with zero free parameter, "simple" /i- 
function assumed). 

local ratio of dark matter to baryons is close to zero. For this reason, the ACDM prediction is that 
these objects should not exhibit a mass discrepancy problem. However, the first ever measurement 
of the rotation curve of three TDGs in the NGC 5291 ring system (Figure IM)) has rather revealed 
the presence of dark matter in these three objects [73]. A solution to explain this in the standard 
picture could then be to resort to dark baryons in the form of cold molecular gas in the disks of 
the progenitor galaxies. However, it is very surprising that a very different kind of dark matter, 
in this case baryonic dark matter, would conspire to assemble itself precisely in the right way 
such as to put the three TDGs (see Sect. 4.3.1) on the baryonic Tully-Fisher relation (when this 
baryonic dark matter is not taken into account in the baryonic budget of the BTF). Another 
possibility, not resorting to baryonic dark matter, would be that, by chance, the three TDGs 
have been observed precisely edge-on. However, if we simply consider the most natural inclination 
coming from the geometry of the ring (i = 45°, see [73]), and apply Milgrom's formula to the 
visible matter distribution with zero free parameters [1661 1310] , one gets very reasonable curves 
(Figure [55]) . Playing around a little bit with the inclinations allowed perfect fits to these rotation 
curves |166| . while the influence of the external field effect has been shown not to significantly 
change the result. We can therefore conclude that ACDM has severe problems with these objects, 
while MOND does exceedingly well in explaining their observed rotation curves. 

However, the observations of only three TDGs are of course not enough, from a statistical point 
of view, in order for this result to be as robust as needed. Many other TDGs should be observed to 
randomize the uncertainties, and consolidate (or invalidate) this potentially extremely important 
result, that could allow to really discriminate between Milgrom's law being either a consequence 
of some fundamental aspect of gravity (or of the nature of dark matter), or simply a mere recipe 
for how CDM organizes itself inside spiral galaxies. As a summary, since the internal dynamics of 
tidal dwarfs should not be affected by CDM, they cannot obey Milgrom's law for a statistically 
significant sample of TDGs if Milgrom's law is only linked to the way CDM assembles itself in 
galaxies. Observations of the internal dynamics of TDGs should thus be one of the observational 
priorities of the coming years in order to settle this debate. 

Finally, let us note that it has been suggested 240', as a possible solution to the satellites 
phase-space correlation problem of Sect. 4.2, that most dwarf satellites of the Milky Way could 
have been formed tidally, thereby being old tidal dwarf galaxies. They would then naturally appear 
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in closely related planes, explaining the observed disk-of-satellites. While this scenario would lead 
to a missing satellites catastrophe in ACDM (see Sect. 4.2), it could actually make sense in a 
MONDian Universe (see Sect. 9.2). 

6.6 MOND in pressure supported stellar systems 

We have already outlined (Sect. 5.2) how Milgrom's formula accounts for general scaling relations 
of pressure-supported systems such as the Faber- Jackson relation (Figure [7] and see |395| ). and 
that isothermal systems have a finite mass in MOND with the density at large radii falling approx- 
imately as r~'^ j297j . Note also that, in order to match the observed fundamental plane, MOND 
models must actually deviate somewhat from being strictly isothermal and isotropic: a radial orbit 
anisotropy in the outer regions is needed [3881 [87] . Here we concentrate on slightly more detailed 
predictions and scaling relations. In general, these detailed predictions are less obvious to make 
than in rotationally supported systems, precisely because of the new degree of freedom introduced 
by the anisotropy of the velocity distribution, very difficult to constrain observationally (as higher 
order moments than the velocity dispersions would be needed to constrain it). As we shall see, 
the successes of MOND are in general a bit less impressive in pressure-supported systems than 
in rotationally supported ones, and even in some cases really problematic (e.g., in the case of 
galaxy clusters, see Sect. 6.6.4). Whether this is due to the fact that predictions are less obvious 
to make, or whether this truly refiects a breakdown of Milgrom's formula for these objects (or the 
fact that certain theoretical versions of MOND would explicitly deviate from Milgrom's formula 
in pressure-supported systems, see Sect. 6.1.1) remains unclear. 

6.6.1 Elliptical galaxies 

Luminous elliptical galaxies are dense bodies of old stars with very little gas and typically large 
internal accelerations. The age of the stellar populations suggest they formed early and all the 
gas has been used to form stars. To form early, one might expect the presence of a massive dark 
matter halo, but the study of, e.g., |368j showed that actually, there is very little evidence for 
dark matter within the effective radius, and even several effective radii, in ellipticals. On the 
other hand, these are very high surface brightness objects and would thus not be expected to 
show a large mass discrepancy within the bright optical object in MOND. And indeed, the results 
of |368) were shown to be in perfect agreement with MOND predictions, assuming very reasonable 
anisotropy profiles |324| . On the theoretical side, it was also importantly shown that triaxial 
elliptical galaxies can be reproduced using the Schwarzschild orbit superposition technique [483] . 
and that these models are stable t494pl . 

Interestingly, some observational studies circumvented the mass-anisotropy degeneracy by con- 
structing non-parametric models of observed elliptical galaxies, from which equivalent circular 
velocity curves, radial profiles of mass-to-light ratio, and anisotropy profiles as well as high-order 
moments could be computed |172) . Thanks to these studies, it was e.g., shown |172| that, although 
not much dark matter is needed, the equivalent circular velocity curves (see also [485] where the 
rotation curve could also be measured directly) tend to become flat at much larger accelerations 
than in thin exponential disk galaxies. This would seem to contradict the MOND prescription, 
for which flat circular velocities typically occur well below the acceleration threshold ag, but not 
at accelerations of the order of a few times uq as in ellipticals. However, as shown in |364J . if 
one assumes the simple interpolating function (a = n = 1 in Eq. 1461 and Eq. 1491) , known to yield 
excellents fits to spiral galaxy rotation curves (see Sect. 6.5.1), one finds that MONDian galaxies 
exhibit a flattening of their circular velocity curve at high accelerations if they can be described 
by a Jaffe profile [209] in the region where the circular velocity is constant. Since this fiattening at 

Separable models have also been investigated in |98| 
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high accelerations is not possible for exponential profiles, it is thus remarkable that such flattenings 
of circular velocity curves at high accelerations are only observed in elliptical galaxies. What is 
more, |172) , as well as |455j , derived from their models scaling relations for the configuration space 
and phase-space densities of dark matter in ellipticals, and these DM scaling relations have been 
shown [364] to be in very good agreement with the MOND predictions on "phantom DM" (Eq. [33]) 
scaling relations. This is displayed on Figure l36l Of course, some of these galaxies are residing in 
clusters, and the external field effect (see Sect. 6.3) could thus modify the predictions, but this was 
shown to be negligible for most of the analyzed sample, because the galaxies are far away from the 
cluster center [364] . Note that when closer to the center of galaxy clusters, interesting behaviors 
such as lopsidedness caused by the external field effect could allow new tests of MOND in the near 
future |492) . However, this would require modeling both the orbit of the galaxy in the cluster to 
take into account time- variations of the external field, as well as a precise estimate of the external 
field from the cluster itself, which can be tricky as the whole cluster should be modeled at once 
due to the non-linearity of MOND [TT4l[260] . 




Figure 36: MOND phantom dark matter scaling relations in ellipticals. The circles display central 
density po, and central phase space density / of the phantom dark halos predicted by MOND 
for different masses of baryonic Hernquist profiles (with scale-radius rn related to the effective 
radius by ReS = 1.815 r//). The dotted lines are the scaling relations of |172) . and the dashed lines 
those of [455], which exhibit a very large observational scatter in good agreement with the MOND 
prediction |364j . 

At a more detailed level, precise full line-of-sight velocity dispersion profiles of individual el- 
litpticals, typically measured with tracers such as PNe or globular clusters populations, have been 
reproduced by solving Jeans equation in spherical symmetry: 

da^ 2(213 + a) , , ,^ , 

+ fj2W^ '- = -g{r) 65 

ar r 

where a is the radial velocity dispersion, a = dlnp/dlnr is the slope of the tracer density p, and 
(3 = 1 — (cTg + cr^)/2CT^ is the velocity anisotropy. Note that on the left-hand side, one uses the 
density and the velocity dispersion of the tracers only, which can be different from the density 
producing the gravity on the right-hand side if a specific population of tracers such as globular 
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clusters is used. When the global kinematics of a galaxy is analyzed, we do expect in MOND that 
the gravity on the right-hand side of Eq. [BS] is generated by the observed mass distribution, so 
both should be fit simultaneously: Figure 1571 (provided by [404] ) shows an example. In general, 
it was found that field galaxies are all fit very naturally with MOND |462l 1411] (see also |485j ). 
On the other hand, the MOND modification has been found to slightly underpredict the velocity 
dispersions in large elliptical galaxies at the very center of galaxy clusters [365], which is just the 
small-scale equivalent of the problem of MOND in clusters, pointing towards missing baryons (see 
Sect. 6.6.4). 
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Figure 37: The surface brightness (a) and velocity dispersion (b) profiles of the elliptical galaxy 
NGC 7507 [376] fitted by MOND (lines [404]). Elliptical galaxies can be approximated in MOND 
as high order polytropes with some radial orbit anisotropy [388J . This particular case has a 
polytropic index of 14 with anisotropy of the Osipkov-Merritt form with an anisotropy radius of 
5 kpc and maximum anisotropy /? — 0.75 at large radii [404]. The stellar mass-to-light ratio is 
Tf = 3.03 Mq/Lq. This simple model captures the gross properties of both the surface brightness 
and velocity dispersion profiles. The galaxy is well- fitted by MOND, contrary to the claim of 376]. 

On the other hand, [226] used satellite galaxies of ellipticals to test MOND at distances of 
several 100 kpcs. They used the stacked SDSS satellites to generate a pair of mock galaxy groups 
with reasonably precise line-of-sight velocity dispersions as a function of radius across the group. 
When these systems were first analysed by [226] they claimed that MOND was excluded by lOcr, 
but this was only for models that had constant velocity anisotropy. It was then found [14] that 
with varying anisotropy profiles similar to those found in simulations of formation of ellipticals by 
dissipationless collapse in MOND |338j , excellent fits to the los velocity dispersions of both mock 
galaxies could be found are excellent and can be taken as strong evidence that MOND describes 
the dynamics in the surroundings of relatively isolated ellipticals very well. 

Finally, let us note an intriguing possibility in a MONDian Universe (see also Sect. 9.2). While 
massive ellipticals would form at z w 10 |393] from monolithic dissipationless collapse |338j . dwarf 
ellipticals could be more difficult to form. A possibility to form those would then be that tidal 
dwarf galaxies would be formed and survive more easily (see Sect. 6.5.4) in major mergers, and 
could then evolve to lead to the population of dwarf ellipticals seen today, thereby providing a 
natural explanation for the observed density-morphology relation |240j (more dwarf ellipticals in 
denser environments). 
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6.6.2 Dwarf spheroidal galaixies 



Dwarf spheroidal (dSph) satellites of the Milky Way [4281 1478] exhibit sonie of the largest mass 
discrepancies observed in the Universe. In this sense, they are extremely interesting objects in 
which to test MOND. Observationally, let us note that there are essentially two classes of objects 
in the galactic stellar halo: globular clusters (see Sect. 6.6.3 hereafter) and dSph galaxies. These 
overlap in baryonic mass, but not in surface brightness, nor in age or uniformity of the stellar 
populations. The globular clusters are generally composed of old stellar populations, they are HSB 
objects and mostly exhibit no mass discrepancy problem, as expected for HSB objects in MOND. 
The dSphs, on the contrary, generally contain slightly younger stellar populations covering a range 
of ages, they are extreme LSB objects and exhibit, as said before, an extreme mass discrepancy, 
as generically expected from MOND. So, contrary to the case of ACDM where different formation 
scenarios have to be invoked (see also Sect. 6.6.3), the different mass discrepancies in these objects 
find a natural explanation in MOND. 

At a more detailed level, MOND should also be able to fit the whole velocity dispersion profiles, 
and not only give the right ballpark prediction. This analysis has recently been possible for the 
eight "classical" dSph around the Milky Way |478| . Solving Jeans equation (Eq. l55|) . it was found |5] 
that the four most massive and distant dwarf galaxies (Fornax, Sculptor, Leo I and Leo II) have 
typical stellar mass-to-light ratios, exactly within the expected range. Assuming equilibrium, two 
of the other four (smallest and most nearby) dSphs have mass-to-light ratios that are a bit higher 
than expected (Carina and Ursa Minor), and two have very high ones (Sextans and Draco). For all 
these dSphs, there is a remarkable correlation between the stellar AI /L inferred from MOND and 
the ages of their stellar populations |190j . Concerning the high inferred stellar M/L, note that it 
has been shown [75] that a dSph will begin to suffer tidal disruption at distances from the Milky 
Way that are 4-7 larger in MOND than in CDM, Sextans and Draco could thus actually be partly 
tidally disrupted in MOND. And indeed, after subjecting the five dSphs with published data to an 
interloper removal algorithm j419) . it was found that Sextans was probably littered with unbound 
stars which inflated the computed M/L, while Draco's projected distance-l.o.s. velocity diagram 
actually looks as out-of-equilibrium as Sextans' one. Ursa Minor, on the other hand, is the typical 
example of an out-of-equilibrium system, elongated and showing evidence of tidal tails. In the end, 
only Carina thus has a suspiciously high M/L (> 4, see |419J ). 

What is more, there is a possibility that, in a MONDian Universe, dSphs are not primordial 
objects but have been tidally formed in a major merger (see Sect. 9.2 as a solution to the phase- 
space correlation challenge of Sect. 4.2). In addition to the MOND effect, it would be possible 
that these objects never really reach a stable equilibrium [238] . and exhibit artifically high M/L 
ratio. This is even more true for the recently discovered "ultra-faint" dwarf spheroidals, that are 
also, due to to their extremely low-density, very much prone to tidal heating in MOND. Indeed, 
at face-value, if these ultrafaints are equilibrium objects, their velocity dispersions are much too 
high compared to what MOND predicts, and rule out MOND straightforwardly. However, unless 
this is due to systematic errors linked with the smallness of the velocity dispersion to measure 
(one must distinguish between cr sa 2 kms^^ and cr w 5 kms~^), and/or to high intrinsic stellar 
M/L ratios related to stochastic effects linked with the small number of stars [187] . it was also 
found |285j that these objects are all close to filling their MONDian tidal radii, and that their 
stars can complete only a few orbits for every orbit of the satellite itself around the Milky Way 
(see Figure [55)1 . As Brada & Milgrom [79 have shown, it then comes as no surprise that they 
are displaying out-of-equilibrium dynamics in MOND (and even more so in the case of a tidal 
formation scenario [238] ). 
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Figure 38: The characteristic acceleration, in units of oq, in the smallest galaxies known: the 
dwarf satellites of the Milky Way (orange squares) and M31 (pink squares) [285] . The so-called 
classical dwarfs, with thousands of velocity measurements of individual stars [478] , are largely 
consistent with MOND. The more recently discovered "ultrafaint" dwarfs, tiny systems with only 
a handful of stars |428j . typically are not, in the sense that their measured velocity dispersions 
and accelerations are too high. This could be due to systematic uncertainties in the data 2Z1\, 
as we must distinguish between a ~ 2 kms^^ and 5 kms^^. Nevertheless, there may be a 
good physical reason for the non-compliance of the ultrafaint galaxies in the context of MOND. 
The deviation of these objects only occurs in systems where the stars are close to filling their 
MONDian tidal radii: the left panel shows the half light radius relative to the tidal radius. Such 
systems may not be in equilibrium. Brada & Milgrom [79, note that systems will no longer respond 
adiabatically to the influence of their host galaxy when a star in a satellite galaxy can complete 
only a few orbits for every orbit the satellite makes about its host. The deviant dwarfs are in this 
regime (right panel). 
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6.6.3 Star clusters 



Star clusters come in two types: open clusters and globular clusters. Most observed open clusters 
are in the inner parts of the Milky Way disk, and for that reason, the prediction of MOND is that 
their internal dynamics is Newtonian |294) with, perhaps, a slightly renormalized gravitational 
constant and slightly squashed isopotentials, due to the external field effect (Sect. 6.3). The 
possibility of distinguishing Newtonian dynamics from MOND in these objects would therefore 
require extreme precision. On the other hand, globular clusters are mostly HSB halo objects 
(see Sect. 6.6.2), and are consequently predicted to be Newtonian, and most of those that are 
fluffy enough to display a MONDian behavior are close enough to the Galactic disk to be affected 
by the external field effect (Sect. 6.3), and so are Newtonian, too. Interestingly, MOND thus 
provides a natural explanation for the dichotomy between dwarf spheroidals and globular clusters. 
In ACDM, this dichotomy is rather explained by the formation history |236II397] : globular clusters 
are supposedly formed in primordial disk-bound supermassive molecular clouds with high baryon- 
to-dark matter ratio, and later become more spheroidal due to subsequent mergers. In MOND, it is 
of course not implied that the two classes of objects have necessarily the same formation history, but 
the different dynamics are qualitatively explained by MOND itself, not by the different formation 
scenarios. 

However, there exist a few globular clusters (roughly, less than ^ 10 compared to a total number 
of 150) both fluffy enough to display typical internal accelerations well below oq, and far away 
enough from the Galactic plane to be more or less immune from the external field effect ^28l 11831 
I182| 1437] . These should thus in principle display a MONDian mass discrepancy. They include, 
e.g.. Pal 14 and Pal 3, or the large fluffy globular cluster NGC 2419. Pal 3 is interesting, because it 
indeed tends to display a larger than Newtonian global velocity dispersion, broadly in agreement 
with the MOND prediction (Baumgardt & Kroupa, private communication). However, it is difficult 
to draw too strong a conclusion from this (e.g., on excluding Newtonian dynamics), since there 
are not many stars observed, and one or two outliers would be sufficient to make the dispersion 
grow artificially, while a slightly higher than usual mass-to-light ratio could reconcile Newtonian 
dynamics with the data. Other clusters such as NGC 1851 and NGC 1904 apparently display the 
same MONDian behavior |409| (see also [188] . On the other hand. Pal 14 displays exactly the 
opposite behavior: the measured velocity dispersion is Newtonian [213] . but again the number of 
observed stars is too small to draw a statistically significant conclusion jl65j , and it is still possible 
to reconcile the data with MOND assuming a slightly low stellar mass-to-light ratio [438] . Note 
that if the cluster is on a highly eccentric orbit, the external gravitational field could vary very 
rapidly both in amplitude and direction, and it is possible that the cluster could take some time to 
accomodate this by still displaying a Newtonian signature in its kinematics after a sudden decrease 
of the external field. 

NGC 2419 is an interesting case, because it allows not only for a measure of the global velocity 
dispersion, but also of the detailed velocity dispersion profile [200) . And, again, like in the case 
of Pal 14 (but contrary to Pal 3), it displays Newtonian behavior. More precisely, it was found, 
solving Jeans equations (Eg . |65|) . that the best MOND fit, although not extremely bad in itself, 
was 350 times less likely than the best Newtonian fit without DM p00l[20T]. The stability \337\ 
of this best MOND fit has however not been checked in detail. These results are, however, heavily 
debated as they rely on the small quoted measurement errors on the surface density, and even a 
slight rotation of only the outer parts of this system near the plane of the sky (which would not 
show up in th velocity data) would make a considerable difference in the right direction for MOND 
[403]. However, these observations, together with the results on Pal 14, although not ruling out any 
theory, are not a resounding success for MOND. It could however perhaps indicate that globular 
clusters are generically on highly eccentric orbits, and out of equilibrium due to this (however, 
the effect would have to be opposite to that prevailing in ultra-faint dwarfs, where the departure 
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from equilibrium would boost the velocity dispersion instead of decreasing it). A stronger view on 
these results could indicate that MOND as formulated today is an incomplete paradigm (see, e.g., 
Eq. E7)) , or that MOND is an effect due to the fundamental nature of the DM fluid in galaxies (see 
Sects. 7.6 and 7.9), which is absent from globular clusters. Concerning NGC 2419, it is however 
perhaps useful to remind that it is very plausibly not a globular cluster. It is part of the Virgo 
stream and is thus most probably the remaining nucleus of a disrupting satellite galaxy in the halo 
of the Milky Way, on a generically highly eccentric orbit. Detailed N-body simulations of such an 
event, and of the internal dynamics of the remaining nucleus, would thus be the key to confront 
MOND with observations in this object. All in all, the situation regarding MOND and the internal 
dynamics of globular clusters thus remains unclear. 

On the other hand, it has been noted that MOND seems to overpredict the Roche lobe volume 
of globular clusters |500l I50H 1513] . Again, the fact that globular clusters could generically be 
on highly eccentric orbits could come to the rescue here. What is more, it was shown that, in 
MOND, globular clusters can have a cutoff radius which is unrelated to the tidal radius when non- 
isothermal [397) . In general the cutoff radii of dwarf spheroidals, which have comparable baryonic 
masses, are larger than those of the globular clusters, meaning that those may well extend to their 
tidal radii because of a possibly different formation history than globular clusters. 

Finally, a last issue for MOND related to globular clusters [3361 1378] is the existence of five 
such objects surrounding the Fornax dwarf spheroidal galaxy. Indeed, under similar environmental 
conditions, dynamical friction occurs on significantly shorter timescales in MOND than standard 
dynamics [96], which could cause the globular clusters to spiral in and merge within at most 
2 Gyrs |378j . However, this strongly depends on the orbits of the globular clusters, and in particular 
their initial radius [10], which can allow for a Hubble time survival of the orbits in MOND. 

6.6.4 Galaxy groups and clusters 

As pointed out earlier (3rd Kepler-like law of Sect. 5.2), it is a natural consequence of Milgrom's law 
that at the effective baryonic radius of the system, the typical acceleration a^/i? is always observed 
to be of the order of oq, thereby naturally explaining the linear relation between size and temper- 
ature for galaxy clusters [3281 1392] . However, one of the main predictions of Milgrom's formula is 
the so-called baryonic Tully-Fisher relation (circular velocity vs. baryonic mass. Figure [3]), and 
its equivalent for isotropic pressure supported systems, the Faber- Jackson relation (stellar velocity 
dispersion vs. baryonic mass. Figure [7]), both for their slope and normalization. For systems such 
as galaxy clusters, where the hot intra-cluster gas is the major baryonic component, this relation 
can also be translated into a "gas temperature vs. baryonic mass" relation Mi, cx T^, plotted on 
FigurelSni as the line log(Mb/M0) = 2 log(T/keV) + 12.9 (note that this differs slightly from [55^] 
where solar metallicity gas is assumed). Note on this figure that observations are closer to the 
MOND predicted slope than to the conventional prediction of M cx T'^/^ in ACDM, without the 
need to invoke preheating (a need that may arise as an artifact of the mismatch in slopes). 

So, interestingly, the data are still reasonably consistent with the slope predicted by MOND [383] . 
but not with the normalization. There is roughly a factor of two of residual missing mass in these 
objects [nil [3SS1 IMZl [3M1 IMl ESM- This conclusion, reached from applying the hydrostatic 
equilibrium equation to the temperature profile of the X-ray emitting gas of these objects, has 
also been reached for low mass X-ray emitting groups [12]. This is essentially because, contrary to 
the case of galaxies, there is observationally a need for "Newtonian" missing mass in the central 
partS of clusters, where the observed acceleration is usually slightly larger than ao, meaning that 
the MOND prescription is not enough to explain the observed discrepancy between visible and 

''^The conventional baryon fraction of clusters increases monotonically with radiusQJJ, only obtaining the cosmic 
value of 0.17 at or beyond the virial radius. One might therefore infer the presence of dark baryons in cluster cores 
in ACDM as well as in MOND. 
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Figure 39: The baryonic mass-X-ray temperature relation for rich clusters (gray triangles [360| 
1389) ) and groups of galaxies (green triangles [12]). The solid line indicates the a priori prediction 
of MOND: the data are reasonably consistent with the slope (M cx T^), but not with the nor- 
malization. This is the residual missing baryon problem in MOND: there should be roughly twice 
as much mass (on average) as observed. Also shown is the scaling relation a priori expected in 
ACDM (dashed line |138j ). This is in better (if not perfect) agreement with the normalization of 
the data for rich clusters, but not the slope. The difference is sometimes attributed to preheating 
of the gas [497) . which might also occur in MOND. 

dynamical mass there. For this reason, the residual missing mass in MOND is essentially con- 
centrated in the central parts of clusters, where the ratio of MOND dynamical mass to observed 
baryonic mass reaches a value of 10, to then only decrease to a value of roughly ~ 2 in the very 
outer parts, where almost no residual mass is present. The profile of this residual mass would thus 
consist of a large constant density core of about 100-200 kpc in size (depending on the size of the 
group/cluster in question), followed by a sharp cutoff. 

The need for this residual missing mass in MOND might be taken in one of the five following 
ways: 

(i) Practical falsification of MOND, 

(ii) Evidence for missing baryons in the central parts of clusters, 

(iii) Evidence for non-baryonic dark matter (existing or exotic), 

(iv) Evidence that MOND is an incomplete paradigm, 

(v) Evidence for the effect of additional fields in the parent relativistic theories of MOND, not 
included in Milgrom's formula. 

If (i) is correct, one still needs to explain the success of MOND on galaxy scales with ACDM. 
Such an explanation has yet to be offered. Thus, tempting as case (i) is, it is worth giving closer 
inspection of the four other possibilities. 

The second case (ii) would be most in line with the elegant absence of need for any non-baryonic 
mass in MOND (however, see the "dark fields" invoked in Sect. 7). It has happened before that 
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Baryon Budget 




Figure 40: The baryon budget in the low redshift Universe adopted from [422) . The census of 
baryons includes the detected Warm-Hot Intergalactic Medium (WHIM), the Lymana forest, stars 
in galaxies, detected cold gas in galaxies (atomic HI and molecular H2), other gas associated with 
galaxies (the Circumgalactic Medium, CGM), and the Intracluster Medium (ICM) of groups and 
clusters of galaxies. The sum of known baryons falls short of the density of baryons expected from 
Big Bang Nucleosynthesis: ~ 30% are missing. These missing baryons presumably exist in some as 
yet undetected (i.e., dark) form. If a fraction of these dark baryons reside in clusters (an amount 
roughly comparable to that in the ICM) it would suffice to explain the residual mass discrepancy 
problem MOND suffers in galaxy clusters. 

most of the baryonic mass was in an unobserved component. From the 1930s when Zwicky first 
discovered the missing mass problem in clusters till the 1980s, it was widely presumed that the 
stars in the observed galaxies represented the bulk of baryonic mass in clusters. Only after the 
introduction of MOND (in 1983) did it become widely appreciated that the diffuse X-ray emitting 
intracluster gas (the ICM) greatly outweighed the stars. That is to say, some of the missing mass 
problem in clusters was due to optically dark baryons — instead of the enormous mass discrepancies 
implied by cluster dynamical mass to optical light ratios in excess of 100 [25], the ratio of dark to 
baryonic mass is only ^ 8 conventionally |1761 1286] . So we should not be too hasty in presuming 
we now have a complete census of baryons in clusters. Indeed, in the global baryon inventory of 
the Universe, ~ 30% of the baryons produced during big bang nucleosynthesis (BBN) are missing 
(Figure HO)) , and presumably reside in some as yet undetected (dark) form. It is estimated [16111422] 
that the observed baryons in clusters only account for about 4% of those produced during BBN 
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(Figure|40|). This is much less than the 30% of baryons that are stiU missing. Consequently, only a 
modest fraction of the dark baryons need to reside in clusters to solve the problem of missing mass 
in the central regions of clusters in MOND. It should be highlighted that this missing mass only 
appears in MOND for systems with a high abundance of ionised gas and X-ray emission. Indeed, 
for even smaller galaxy groups, devoid of gas, the MOND predictions for the velocity dispersions of 
individual galaxies are again perfectly in line with the observations [304| 1308] . It is then no stretch 
of the imagination to surmise that these gas rich systems, where the residual missing baryons 
problem have equal quantities of molecular hydrogen or other molecules. Milgrom |311| has, e.g., 
proposed that the missing mass in MOND could entirely be in the form of cold, dense gas clouds. 
There is an extensive literature discussing searches for cold gas in the cores of galaxy clusters 
but what is usually meant there is quite different from what is meant here, since those searches 
consisted in trying to find the signature of diffuse cold molecular gas at a temperature of ^ 30 K. 
The proposition of Milgrom |311| rather relies on the work of Pfenniger & Combes [353] . where 
dense gas clouds with a temperature of only a few Kelvins (~ 3 K), Solar System size, and of a 
Jupiter mass, were considered to be possible candidates for both galactic and extragalactic dark 
matter. These clouds would behave in a collisionless way, just like stars. Since the dark mass 
considered in the context of MOND cannot be present in galaxies, it is however not subject to the 
galactic constraints on such gas clouds. Note that the total sky covering factor of such clouds in 
the core of the clusters would be of the order of only 10~^, so that they would only occult a minor 
fraction of the X-rays emitted by the hot gas (and it would be a rather constant fraction). For the 
same reason, the chances of a given quasar having light absorbed by them is very small. Still, |311j 
notes that these clouds could be probed through X-ray hashes coming out of individual collisions 
between them. Of course, this speculative idea also raises a number of questions, the most serious 
one being how these clumps form and stabilize, and why they form only in clusters. X-ray emitting 
groups and some ellipticals at the center of these groups and clusters, but not in individual spiral 
galaxies. As noted above, the fact that missing mass in MOND is necessarily associated with an 
abundance of ionised gas could be a hint at a formation and stabilization process somehow linked 
with the presence of hot gas and X-ray emission themselves. Then, there is the issue to know 
whether the clouds formation would be prior or posterior to the cluster formation. We note that a 
rather late formation mechanism could help increase the metal abundance, solving the problem of 
small-scale variations of metallicity in clusters when the clouds are destroyed [331] . Milgrom |311| 
also noted that these clouds could alleviate the cooling flow conundrum, because whatever destroys 
them (e.g., cloud-cloud collisions and dynamical friction between the clouds and the hot gas) is 
conducive to heating the core gas, and thus preventing it from cooling too quickly. Such a heating 
source would not be transient and would be quite isotropic, contrary to AGN heating. 

Another possibility (iii) would be that this residual missing mass in clusters is in the form of 
non-baryonic matter. There is one obviously existing form of such matter: neutrinos. If rriv ~ 
vArr? [4351 . then the neutrino mass is too small to be of interest in this context. But there is 
nothing that prevents it from being larger (note that the "cosmological" constraints from structure 
formation in the ACDM context obviously do not apply in MOND). Actual model-independent 
experimental limits on the electron neutrino mass from the Mainz/Troitsk experiments, counting 
the highest energy electrons in the /3-decay of Tritium [235] are < 2.2 eV. Interestingly, the 
KATRIN experiment (the KArlsruhe TRItium Neutrino experiment, under construction) will be 
able to falsify these 2eV electron neutrinos at 95% confidence. If the neutrino mass is substantially 
larger than the mass differences, then all types have about the same mass, and the cosmological 
density of three left-handed neutrinos and their antiparticles ^392^ would be 

= 0.062mi„ (66) 

where is the mass of a single neutrino type in eV. If one assumes that clusters of galaxies 
respect the baryon-neutrino cosmological ratio, and that the MOND missing mass is mostly made 
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of neutrinos as suggested by |389| 1392] , then the mass of neutrinos must indeed be around 2 eV. 
Combined with the effect of additional degrees of freedom in relativistic MOND theories (Sect. 7), 
it has been shown that the CMB anisotropics could also be reproduced (see Sect. 9.2 and '431'), 
while this hot dark matter would obviously free-stream out of spiral galaxies and would thus not 
perturb the MOND fits of Sect. 6.5.1. The main limit on the neutrino ability to condense in 
clusters comes from the Tremaine-Gunn limit [464] . stating that the phase space density must be 
preserved during collapse. This is a density level half the quantum mechanical degeneracy level in 
phase-space: 



Converting this into configuration space, the maximum density for a cluster of a given temperature, 
T, is defined for a given mass of one neutrino type as [464] : 



Assuming the temperature of the neutrino fiuid as being equal (due to violent relaxation) to the 
mean emission weighted temperature of the gas, Sanders |389] showed that such 2 eV neutrinos 
at the limit of experimental detection could indeed account for the bulk of the dynamical mass in 
his sample of galaxy clusters of T > 4 keV (see also Sect. 8.3 for gravitational lensing constraints). 
This has the great advantage of naturally reproducing the proportionality of the electron density 
in the cores of clusters to T'^/^, as observed j392l . However, looking at the central region of 
low-temperature X-ray emitting galaxy groups, it was found |12| that the needed central density 
of missing mass far exceeded this limit by a factor of several hundred. One would need one 
neutrino species with m ^ 10 cV to reach the required densities. One exotic possibility is then the 
idea of right-handed eV-scale sterile neutrinos T^: as strange as this sounds, this mass for sterile 
neutrinos could also provide a good fit to the CMB acoustic peaks (see Sect. 9.2). This could indeed 
sound as the strangest and most complicated Universe possible, combining true non-baryonic (hot) 
dark matter with a modification of gravity, but if this is what it takes to simultaneously explain 
the Kepler-like laws of galactic dynamics and the extragalactic evidence for dark matter, it is 
useful to remember that there are a priori both good reasons for there being more particles than 
those of the standard model of particle physics and that there is a priori no reason that General 
Relativity should be valid over a wide range of scales where it has never been tested. In any case, 
experiments that can address the existence of such a ~ 10 eV-scale sterile neutrino would thus be 
very interesting, as this kind of particle could provide the dark matter candidate only in a modified 
gravity framework, since such a hot dark matter particle would be unable to form small structures 
and to provide the dark matter that would be needed in galaxies. 

Yet another possibility (iv) would be that MOND is incomplete, and that a new scale should 
be introduced, in order to effectively enhance the value of ao in galaxy clusters, while lowering it 
to its preferred value in galaxies. There are several ways to implement such an idea. For instance 
Bekenstein [37] proposed adding a second scale in order to allow for effective variations of the 
acceleration constant as a function of the deepness of the potential (Eq. [?f)) . This idea should be 
investigated more in the future, but it is not clear that such a simple rescaling of qq would account 
for the exact spatial distribution of the residual missing mass in MOND clusters, especially in 
cases where it is displaced from the baryonic distribution (see Sect. 8.3). However, as even Gauss' 
theorem would not be valid anymore in spherical symmetry, the high non-linearity might provide 
non-intuitive results, and it would thus clearly be worth investigating this suggestion in more detail, 
as well as developing similar ideas with other additional scales in the future (such as, for instance, 
the baryonic matter density, see [831 1144) and Sect. 7.6). 




(67) 




(68) 
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Finally, as we shall see in the next section (Sect. 7), parent relativistic theories of MOND 
often require additional degrees of freedom in the form of "dark fields" , which can nevertheless 
be globally subdominant to the baryon density, and thus do not necessarily act precisely as true 
"dark matter" . The last possibility (v) is thus that these fields, obviously not included in Milgrom's 
formula, are responsible for the cluster missing mass in MOND. An example of such fields are the 
vector fields of TeVeS (Sect. 7.4) and Generalized Einstein- Aether theories (Sect. 7.7). It has 
been shown (see Sect. 9.2) that the growth of the spatial part of the vector perturbation in the 
course of cosmological evolution can successfully seed the growth of baryonic structures, just as 
dark matter does. If these seeds persist, it was shown fll3' that they could behave in very much 
the same way as a dark matter halo in relatively unrelaxed galaxy clusters. However, it remains 
to be seen whether the spatially concentrated distribution of missing mass in MOND would be 
naturally reproduced in all clusters. In other relativistic versions of MOND (see, e.g.. Sect. 7.6 and 
Sect. 7.9), the "dark fields" are truly massive and can be thought of as true dark matter (although 
more complex than simple collisionless DM), whose energy density outweighs the baryonic one, 
and could provide the missing mass in clusters. However, again, it is not obvious that the centrally 
concentrated distribution of residual missing mass in clusters would be naturally reproduced. All 
in all, there is no obviously satisfactory explanation for the problem of residual missing mass in 
the center of galaxy clusters, which remains one of the most serious problems facing MOND. 
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7 Relativistic MOND Theories 



In Sect. 6, we have considered the classical theories of MOND and their predictions in a vast 
number of astrophysical systems. However, as already stated at the beginning of Sect. 6, these 
classical theories are only toy-models until they become the weak-field limit of a relativistic theory 
(with invariant physical laws under differentiable coordinate transformations), i.e., an extension 
of General Relativity (GR) rather than an extension of Newtonian dynamics. Here, we list the 
various existing relativistic theories boiling down to MOND in the quasi-static weak-field limit. It 
is useful to restate here that the motivation for developing such theories is not to get rid of dark 
matter but to explain the Kepler-like laws of galactic dynamics predicted by Milgrom's law (see 
Sect. 5). As we shall see, many of these theories include new fields, so that dark matter is often 
effectively replaced by "dark fields" (although, contrary to dark matter, their energy density can be 
subdominant to the baryonic one; note that, even more importantly, in a static configuration these 
dark fields are fully determined by the baryons, contrary to the traditional dark matter particles 
which may in principle be present independently of baryons). 

These theories are great advances because they enable us to calculate the effects of gravitational 
lensing and the cosmological evolution of the Universe in MOND, which are beyond the capabilities 
of classical theories. However, as we shall see, many of these relativistic theories still have their 
limitations, ranging from true theoretical or observational problems to more aesthetical problems 
such as the arbitrary introduction of an interpolating function (Sect. 6.2) or the absence of an 
understanding of the A Uq coincidence. What is more, the new fields introduced in these 
theories have no counterpart yet in microphysics, meaning that these theories are, at best, only 
effective. So, despite the existing effective relativistic theories presented here, the quest for a more 
profound relativistic formulation of MOND continues. Excellent reviews of existing theories can 
also be found in, e.g., [551 [Ml 15^ [TUT] [Wl [TMl imi 11501 HH^ . 

The heart of GR is the equivalence principle(s), in its weak (WEP), Einstein (EEP) and strong 
(SEP) form. The WEP states the universality of free fall, while the EEP states that one recovers 
special relativity in the freely falling frame of the WEP. These equivalence principles are obtained 
by assuming that all known matter fields are universally and minimally coupled to one single 
metric tensor, the physical metric. It is perfectly fine to keep these principles in MOND, although 
certain versions can involve another type of (dark) matter not following the same geodesies as the 
known matter, and thus effectively violating the WEP. Additionally, note that the local Lorentz 
invariance of special relativity could be spontaneously violated in MOND theories. The SEP, on 
the other hand, states that all laws of physics, including gravitation itself, are fully independent 
of velocity and location in spacetime. This is obtained in GR by making the physical metric itself 
obey the Einstein-Hilbert action. This principle has to be broken in MOND (see also Sect. 6.3). 
We now recall below how GR connects with Newtonian dynamics in the weak-field limit, which 
is actually the regime in which the modification must be set in order to account for the MOND 
phenomenology of the ultra- weak field limit. The action of GR writes as the sum of the matter 
action and the Einstein-Hilbert (gravitational) actiorF^: 

S'gr = 5'mattcr [matter, 5^^] -I- —— / d'^xy/^R, (69) 

iDTTLr J 

where g denotes the determinant of the metric tensor with (—,+,+,+) signatureF^. and R — 
RiJ.vg^'^ is its scalar curvature, Rp,i, being the Ricci tensor (involving second derivatives of the 
metric). The matter action is a functional of the matter fields, depending on them and their first 



'^^ If the action has the units of Ti the factor in front of the gravitational action is rather c^/IGttG. And if one 
wishes to include a cosmological constant A, the integral then rather reads J' d^Xy/—g (R — 2A) 
With this signature the proper-time is defined by dr^ = — g^i/ dx'^dx'^ 
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derivatives. For instance, the matter action of a free point particle Spp writes: 

Spp = ~ J mcds — — J mc^ ~g^,y{x) v'^v'' dt, (70) 

depending on the positions x and on their time-derivatives u^. Varying the matter action w.r.t. 
matter fields degrees of freedom yields the equations of motion, i.e., the geodesic equation in the 
case of a point particle: 

dr^ ~ dr dr ' ' 

where the proper time r = s is approximately equal to ct for slowly moving non-relativistic particles, 
and is the Christoffel symbol involving first derivatives of the metric. On the other hand, 
varying the total action w.r.t. the metric yields Einstein field equations: 

Rfj.f - ■^Rgfj.i' = —^Tf,^, (72) 

where T^^, is the stress-energy tensor defined as the variation of the Lagrangian density of the 
matter fields over the metric. 

In the static weak-field limit, the metric writes (up to third order corrections in l/c'^1^: 

2$ / 2'J'\ 

9oi = 5io = , goo __=^ ^1 ^ ' ffu = ( ^ + ) '^'J ' C^^) 

Taylor Taylor 

where, in GR, 

$ = $jY and = -$Ar, (74) 

and $Ar is the Newtonian gravitational potential. From the (0, 0) components of the weak-field 
metric, one gets back Newton's second law for massive particles d^x"^ /dt^ = — Fqq = —d^N/dx"^ 
from the geodesic equation (Eg. I7ip . On the other hand, Einstein (Eg. 172 p equations give back the 
Newtonian Poisson eguation V^$jv = AirGp. The metric thus plays the role of the gravitational 
potential, and the Christoffel symbol plays the role of acceleration. Note however that if time- 
like geodesies are determined by the (0,0) component of the metric, this is not the case for null 
geodesies. While the gravitational redshift for light-rays is solely governed by the 500 component 
of the metric too, the deflection of light is on the other hand also governed by the gij components 
(more specifically by $ — ^ in the weak-field limit). This means that, in order for the anomalous 
effects of any modified gravity theory respectively on lensing and dynamics to correspond to a 
similai0 amount of "missing mass" in GR, it is crucial that ~ — $ in Eq. [73]for such a theory. 



7.1 Scalar-tensor k-essence 

MOND is an acceleration-based modification of gravity in the ultra- weak- field limit, but since the 
Christoffel symbol, playing the role of acceleration in GR, is not a tensor, it is in principle not 
possible to make a general relativistic theory depend on it. Another natural way to account for 
the departure from Newtonian gravity in the weak-fleld limit and to account for the violation of 
the SEP inherent to the external field effect is to resort to a scalar-tensor theory, as first proposed 
by [39] . The added scalar field can play the role of an auxiliary potential, and its gradient then 

''^Note that, at IPN, this weak-field metric can also be written as goo = — e^*/'^^, gij = ^ '^^ Note also 
that Taylor expanding Eg. 1701 yields Spp = J m(v^ /2 — $jv — c^)dt, so that the sum of the classical kinetic and 
internal actions for a point particle (see Eq. HSU are now lumped together into the matter action. 

The derived lensing and dynamical masses are typically very close to each other but the data are not yet precise 
enough to ascertain that they are exactly identical. 
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has the dimensions of acceleration and can be used to enforce the acceleration-based modification 
of MONO. 

The relativistic theory of [3^ depends on two fields, an "Einstein metric" g^j^ and a scalar field 
4>. The physical metric Quv entering the matter action is then given by a conformal transformation 
of the Einstein metriccZl through an exponential coupling function: 

g^, = e"^g^,. (75) 

In order to recover the MOND dynamics, the Einstein-Hilbert action (involving the Einstein 
metric) remains unchanged (J d'^x^J—g R), and the dimensionless scalar field is g iven a so-called 
k-essence action, with no potential and a non-linear, aquadratic, kinetic ternO inspired by the 
AQUAL action of Eq. [HJ 

S,p ^ J d*x^f{X), (76) 

where fc is a dimensionless constant, I is a length-scale, X = kPg'^'^(j),f^ cf),^, and f(X) is the "MOND 
function" . Since the action of the scalar field is similar to that of the potential in the Bekenstein- 
Milgrom version of classical MOND, this relativistic version is known as the Relativistic Aquadratic 
Lagrangian theory, RAQUAL. 

Varying the action w.r.t. the scalar field yields, in a static configuration, the following modified 
Poisson's equation for the scalar field: 

c^V. [V0/'(fc;2|v0|2)] = kGp, (77) 

and the (0,0) component of the physical metric is given by goo = — e2(*«+'= ^ leading us 
precisely to the situation of Eq. |40]in the weak- field, with $ = $Ar -I- c^(f>, with 

s = (cVao)|V(/)| = {Xc^kfaiy/^ (78) 

and 

A(s) = (47rcVfc)/'(X), (79) 

whose finely tuned relation with the /i-function of Milgrom's law is extensively described in 
Sect. 6.2. We note that the standard choice for A" ^ 1 is /'(A) ~ (A/3)^/^, meaning that in 
order to recover jl{s) = s/^ for small s, where ^ = Gn/G (see Sect. 6.2), one must define the 
length-scale as 

I EE (c2V3fc)/(47rCao). (80) 

It was immediately realized [39] that a k-essence theory such as RAQUAL can exhibit super- 
luminal propagations whenever /"(A) > [8T]. Although it does not threaten causality j81j . 
one has to check that the Cauchy problem is still well-posed for the field equations. It has been 
shown [8TJ I361J that it requires the otherwise free function / to satisfy the following properties, 
VX: 

/'(A) > (81) 
/'(A)+2A/"(A) >0, (82) 

which is the equivalent of the constraints of Eq. 1371 on Milgrom's /^-function. 



The frame associated to the Einstein metric is called the "Einstein frame" as opposed to the "matter frame" 
or "Jordan frame" , associated to the physical metric. 

k-esscncc fields have also recently been reintroduced as possible dark energy fluids, that could also drive 
inflation [20, 21, 93 . This name comes from the fact that their dynamics is dominated by their kinetic term f{X) 
(in the case of RAQUAL, there is no potential at all), contrary to other dark energy models such as quintessence, 
in which the scalar field potential plays the crucial role. 
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However, another problem was immediately realized at an observational level [39l [41] . Because 
of the conformal transformation of Eq. [TSJ one has that 4* 7^ — $ in the RAQUAL equivalent of 
Eq. 1731 In other words, as it is well-known that gravitational lensing is insensitive to conformal 
rescalings of the metric, apart from the contribution of the stress-energy of the scalar field to to 
the source of the Einstein metric [4T| [82] , the "non- Newtonian" effects of the theory respectively 
on lensing and dynamics do not at all correspond to similar amounts of "missing mass" . This is 
also considered a generic problem with any local pure metric formulation of MOND [442] . 



7.2 Stratified theory 

A solution to the above gravitational lensing problem due to the conformal rescaling of the met- 
ric in RAQUAL has been presented in |385] . Inspired by "stratified" theories of gravity j335] , 
Sanders [385] suggested, in addition to the scalar field cf) of RAQUAL, the use of a non-dynamical 
timelike vector field Uf^ = (—1,0,0,0) with unit-norm = —1 (in terms of the Einstein metric), 
in order to enforce a disformal relation between the Einstein and physical metrics: 

g^, = e'^'^g^, - 2 sinh(20)C/^i7,. (83) 

The second term only affects the 500 component, and it then appears immediately that ^' = — $ 
in the weak-field limit (rhs terms of Eq. I73|) . and the problem of lensing is cured. However, the 
prescription that a 4-vector points in the time direction is not a covariant one, and the theory 
should involve strong preferred frames effects, although these can now be fully suppressed, as well 
as any deviation from GR at small distances, with an appropriate additional "Galileon" term in 
addition to the asymptotic deep-MOND k-essence term in the action of the scalar field ^1] (the 
other advantage being that the interpolating function then does not have to be inserted by hand). 
In any case, endowing the vector field with covariant dynamics of its own has thus been the next 
logical step in developing relativistic MOND theories. 



7.3 Original Tensor- Vector-Scalar theory 



The idea of the Tensor- Vector-Scalar theory of Bekenstein [34], dubbed TeVeS, is to keep the 
disformal relation of Eq. [83] between the Einstein metric g^i, and the physical metric g^i, to which 
matter fields couple, but to replace the above non-dynamical vector field by a dynamical vector 
field with an action {K being a dimensionless constant): 



Sr. 



16nG 



K ^ 



I] 



(84) 



akin to that of the electromagnetic 4-potential vector field {U[^^i,^ playing the role of the Faraday 
tensor), but without the coupling term to the 4-current, and with a constraint term forcing the 
unit norm U^Uy = g^'^U^JJu = —1 (A being a Lagrange multiplier function, to be determined as 
the equations are solved). The first term in the integrand takes care of approximately aligning 
Up, with the 4-velocity of matter (when simultaneously solving for (i) the Einstein-like equation of 
the Einstein metric g^i, and for (ii) the vector equation obtained by varying the total action with 
respect to U^). 

Finally, the k-essence action for the scalar field is kept as in RAQUAL (Eq. [75]) . but with 



Xtcvcs = fc/'(g'"'-C/^C/'^) 



(85) 



Contrary to RAQUAL, this scalar field exhibits no superluminal propagation modes. However, [55] 
noted that such superluminal propagation might have to be re- introduced in order to avoid excessive 
Cherenkov radiation and suppression of high-energy cosmic rays (see also [320j ). 
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The static weak-field limit equation for the scalar field is precisely the same as Eq. [77l and the 
scalar field enters the static weak field metric Eq.[73]as $ — = E^N + c^cj) meaning that lensing 
and dynamics are compatible, with S being a factor depending on K and on the cosmological value 
of the scalar field (see Eq. 58 of |34l)- This can be normalized to yield S = 1 at redshift zero. 
Again, all the relations between the free function / and Milgrom's //-function can be found in 
Sect. 6.2 (see also [I46]l432] l 

This theory has played a true historical role as a proof of concept that it was possible to construct 
a fully relativistic theory both enhancing dynamics and lensing in a coherent way and reproduc- 
ing the MOND phenomenology for static configurations with the dynamical 4-vector pointing in 
the time direction. However, the question remained whether these static configurations would be 
stable. What is more, although a classical HamiltoniarF^ unbounded from below in flat spacetime 
would not necessarily be a concern at the classical level (and even less if the model is only "phe- 
nomenological" ) , it would inevitably become a worry for the existence of a stable quantum vacuum 
(see however [197] ). And indeed, it was shown in [99] that models with such "Maxwellian" vec- 
tor fields having a TeVeS-like Lagrange multiplier constraint in their action have a corresponding 
Hamiltonian density that can be made arbitrarily large and negative (see also Sect. IV. A of [82|). 
What is more, even at the classical level, it has been shown that spherically symmetric solutions 
of TeVeS are heavily unstable [4131 1414] , and that this type of vector field causes caustic singular- 
ities |106| , in the sense that the integral curves of the vector are timelike geodesies meeting each 
other when falling into gravity potential wells. Another form was thus needed for the action of the 
TeVeS vector field. 

7.4 Generalized Tensor- Vector-Scalar theory 

The generalization of TeVeS was proposed by Skordis |429j . Inspired by the fact that Einstein- 
Aether theories [207| 1208] also present instabilities when the unit-norm vector field is "Maxwellian" 
as above, it was simply proposed to use a more general Lagrangian density for the vector field, 
akin to that of Einstein- Aether theories: 

Su = J d^x^g [K^f^-Up^^U,^,, - Hr^U^U, + 1)] , (86) 

where 

K^^f"' = cig'^'f" + C2g"^g'"' + c^'g^^f" + CiU^Wf" (87) 

for a set of constants ci, C2, C3, C4. Interestingly, spherically symmetric solutions depend only on the 
combination ci — C4, not on C2 and C3 that can in principle be chosen to avoid the instabilities of the 
original TeVeS theory. The original unstable theory is of course also included in this generalization 
through a specific combination of the four a (see, e.g., [432] ). 

This generalized version is thus the current "working version" of what is now called TeVeS: a 
tensor-vector-scalar theory with an Einstein-like metric, an Einstein- Aether-like unit-norm vector 
field, and a k-essence-like scalar field, all related to the physical metric through Eq. [S3| It has 
been extensively studied, both in its original and generalized form. It has for instance been shown 
that, contrary to many gravity theories with a scalar sector, the theory evidences no cosmological 
evolution of the Newtonian gravitational constant G and only minor evolution of Milgrom's con- 
stant ao [1461 HO] . The fact that the latter is still put in by hand through the length-scale of the 
theory I ~ c^/ag, and has no dynamical connection with the Hubble or cosmological constant is 
however perhaps a serious conceptual shortcoming, together with the free function put by hand in 
the action of the scalar field (but see [22] for a possible solution to the latter shortcoming). The 
relations between this free function and Milgrom's fj, can be found in [14611432] (see also Sect. 6.2), 

Expressed in terms of Un and its congugate momenta P'' = dL/dU/j, 
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the detailed structure of null and timelike geodesies of the theory in [432], the analysis of the 
parametrized post-Newtonian coefficients (including the preferred-frame parameters quantiiying 
the local breaking of Lorentz invariance) in [1741 13731 13911 1451] , solutions for black holes and neu- 
tron stars in [2451 [MS [2481 [Mil ME SMI EM > and gravitational waves in [2171 IHSl [2161 MM ■ ^ 
is important to remember that TeVeS is not equivalent to GR in the strong regime, which is why 
it can be tested there, e.g. with binary pulsars or with the atomic spectral lines from the surface 
of stars |123) . or other very strong field effectj^. However, these effects can always generically be 
suppressed (at the price of introducing a Galileon type term in the action [H]), and such tests 
would never test MOND as a paradigm. It is by testing gravity in the weak field regime that 
MOND can really be put to the test. 

Finally, let us note that TeVeS (and its generalization) has been shown to be expressible (in 
the "matter frame") only in terms of the physical metric gfi,^, and the vector field Uf^ |514) . the 
scalar field being eliminated from the equations through the "unit-norm" constraint in terms of the 
Einstein metric g^'^U^Ui, — —1, leading to g^'^U^U^, — —e^'^'^ . In this form, TeVeS is sometimes 
thought of as GR with an additional "dark fiuid" described by a vector field [504) . 



7.5 Bi-Scalar- Tensor- Vector theory 

In TeVeS [34], the "MOND function" /(Xtcvcs) of Eq. [76l where Xtovcs {g^" ~ U''U'')(j),f, (j),^, 
could also be expressed as a potential V of a non-dynamical scalar field q, i.e., a scalar action for 
TeVeS of the form: 



S^(x - d x^J-g 



(88) 



After variation of the action w.r.t. this non-dynamical field, one gets qX = —V'{q), and variation 
w.r.t. to 4> yields the usual BM Poisson equation for (/) (Eq. [T7]), with q^ cx li[\fX). Inspired 
by an older theory (Phase Coupling Gravity [33l 1382) ) devised in a partially successful attempt 
to eliminate superluminal propagation from RAQUAL (but plagued with the same gravitational 
lensing problem as RAQUAL, and with additional instabilities), Sanders |390j proposed to make 
this field dynamical by adding a kinetic term g^'^q,fj, q,v in the action, leading to the following very 
general action for the scalar fields (/) and q: 



Sr. - ' -^4^ 



'(09) o^- a X 



:^(5^''g,M 9-- +H{q){r'' + U^Un<l>,^ 0„ ) - F{q)U^U'',p,^ +V{q) 



"1 

2' 

(89) 

In this theory (dubbed BSTV for bi-scalar-tensor- vector theory), the physical metric has the same 
a priori form as in TeVeS, meaning that (p is the matter-coupling scalar field, while q only infiuences 
the strength of that coupling. A remarkable achievement of the theory is that the quasi-static field 
equation for <f> can be obtained only in a cosmological context, and thereby naturally explains the 
connection between oq and Hq |390| . What is more, oscillations of the q field around its expectation 
value can be considered as massive dark matter, and is allowing an explanation of the peaks of 
the angular power spectrum of the Cosmic Microwave Background [390] . Unfortunately, various 
instabilities and a Hamiltonian unbounded by below have been evidenced in Sect. IV. A of [82] . 
thus most likely ruling out this theory, at least in its present form. 



7.6 Non-minimal scalar-tensor formalism 

As a consequence of the inability of RAQUAL (the scalar-tensor k-essence of Sect. 7.1) to enhance 
gravitational lensing, all other attempts reviewed so far (Sect. 7.2 to Sect. 7.5) have been plagued 

It is also important to remember that some interpolation functions (Sect. 6.2) are already excluded by Solar 
system tests, and it is thus useless to exclude these over and over again. 
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with an aesthetically unpleasant growth of additional fields and free parameters. This has led 
Bruneton & Esposito-Farese [52] to consider models with fewer additional fields. They first consid- 
ered pure metric theories in which matter is not only coupled to the metric but also non-minimally 
to its curvature (Eqs 5.1 and 5.2 of [82]). While they showed that such models can indeed repro- 
duce the MOND dynamics, they also concluded that they are generically unstable if locality is 
to be preserved (but see Sect. 7.10). They then considered models in which at most one scalar 
field is added, without any additional vector field, but where this field is coupled non-minimally 
to matter, in the sense that the matter-coupling depends on the scalar field itself but also on its 
first derivatives. In other words, the gradient of the scalar field is replacing the dynamical vector 
field of TeVeS. The simple scalar field action is just the normal action of a massive scalar field: 

= j d^x^g [X + 2V{cj,% (90) 

with X = Pg'"'4>,^(l3,^ and V{4>) = l^m'^cj)'^ /2. The physical metric g^j, is then disformally related 
to the Einstein metric through (see Eq. 5.11 of [82 ): 

5^1. = ^^5^1/ + B(l),f, <j>,^ , (91) 

with the functionals 

A{(j>, X) = e^'t' - (j)h{Y)Y/r] , 5(0, X) = -Acjyr^-'^Y/X, (92) 

where Y = {'qa^^f/'^c-^ X-'^/'^ . The free function h{Y) is the "MOND function" playing the role of 
Milgrom's An alternative formulation of the model is obtained by separating the matter action 
into a normal matter action and an "interaction term" between the scalar field, the metric and the 
matter fields [53]. Considering the massive scalar field as a dark matter fiuid, this model can thus 
be interpreted as non-standard baryon-dark matter interaction leading to the MOND behavior. If 
the scalar mass m is small enough, it is a pure MOND theory, but if it is higher, it can lead to 
a "DM-I- MOND" behavior, especially noteworthy in regions of high gravity such as the center of 
galaxy clusters (see Sect. l6.6^ and discussions in [8?). Let us note that, while this theory exhibits 
superluminal propagations outside of matter, it is in principle not a problem for causality j81j . It 
has also been possible to study the behavior of the theory within matter, e.g., within the dilute 
HI gas inside galaxy disks (an analysis which is mostly too difficult to perform in other models 
reviewed so far): this led to a deadly problem, i.e., that the Cauchy problem becomes ill-posed 
and the solutions to field equations ill-defined. A possible solution was proposed in [83], namely 
to make the matter coupling (or, equivalently, the baryon-scalar DM interaction) depend on the 
local density of mattei|f^: this can also lead to an interesting phenomenology, where only gas-rich 
systems behave according to Milgrom's law, while others would behave in a CDM way [144] . A lot 
remains to be studied within this framework. 

7.7 Generalized Einstein- Aether theories 

All theories reviewed so far are best expressed in the "Einstein frame" , and involve an a priori 
form for the physical metric to which matter couples (an a priori form expressed as a function 
of the Einstein metric and of the other additional fields). However, the work of [514] has shown 
that, for instance, TeVeS (Sects. 7.3 and 7.4) is expressible as a pure Tensor- Vector theory in 
the matter frame, and that the physical metric then both satisfies the Einstein-Hilbert action and 
couples minimally to the matter fields, just like in GR. In fact, the modification of gravity in 

A characteristic matter density po thus becomes an additional order parameter, in the spirit of the velocity 
scale so of |37l . see Eq. 3.1 of |83| 
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TeVeS thus only comes from the couphng of the physical metric to the vector field. The idea of 
Zlosnik et al. [51 5) was then that a similar, but simpler, modification of gravity could be obtained 
by devising a simple Tensor- Vector theory in the matter frame, with no a priori on the geometry 
of the physical metric. Starting from the extensively studied Einstein- Aether theories [2071 1208] . 
with a vector action of the type of Eq. [86l the idea is to make the k-essence free function f{X) (the 
"MOND function" of Eq. [76]) act directly on the vector field rather than on an additional scalar 
field. This thus leads to vector k-essence, or Generalized Einstein- Aether (GEA) theories (also 
called non-canonical Einstein- Aether theories), in which the Einstein-Hilbert and matter actions 
remain as in GR, but with an additional unit-norm vector field with the following action j432l 1515] : 

Su ^ "Ye^ / '^"''^ [fi^sc.) - i^Kg^^'u^u, + 1)] , (93) 

where (see Eq. [87l and replacing g^^^ by g^'^) 

^gca = ^'if"^''"f//3,af/.,p. (94) 

The unit-norm constraint fixes the vector field in terms of the metric, and from there we have that, 
in the weak-field limit, Xg^a. oc — |V$p, with $ defined as in Eq. [73] The Einstein equation in 
the weak-field limit then yields a BM type of Poisson equation (Eq. [17]) for the full gravitational 
potential $, with ^ = /' + (1 - /')/(! - C/2) and C = Ci - a [432 . In the deep-MOND limit, 
the usual choice for / is of the type /(ATgca) oc (— A'gca)'^^^ + 2Xgc&/C, and the length-scale must 
be fixed as: 

_ (2 - cy 

'=3/2C3/2ao- ^^^^ 

Let us note that this weak- field limit of GEA theories is different from that of RAQUAL or TeVeS, 
where only the scalar field obeys a BM-like equation governed by an interpolating function /i(s), 
and where the total potential is given by Eq. [301 

The remarkable feature of GEA theories allowing for the desired enhancing of gravitational 
lensing without any apriori on the form of the physical metric is that, writing the metric as in 
Eq. [73l it can be shown [432] that in the limit X^ca. ~^ the action of Eq. [93] is only a function of 
T = ^-t-^f and is thus invariant under disformal transformations [$ — >■ $-f /3(r) ; \1/ — >■ Vl/ — /3(r)], of 
the type of Eq. [83] These GEA theories are currently extensively studied, mostly in a cosmological 
context (see Sect. 9), but also for their parametrized post-Newtonian coefficients in the Solar 
system '66^ or for black hole solutions [452) . 

Interestingly, it has been shown that all these vector field theories (TeVeS, BSTV, GEA) are all 
part of a broad class of theories studied in [184] . Yet other phenomenologically interesting theories 
exist among this class, such as, for instance, the Vh. models considered by Zhao & Li f503[ l507]l511] 
with a dynamical norm vector field, whose norm obeys a potential (giving it a mass) and has a non- 
quadratic kinetic term a- la- RAQUAL, in order to try reproducing both the MOND phenomenology 
and the accelerated expansion of the Universe, while interpreting the vector field as a fluid of 
neutrinos with varying mass [505[ 1506] . This has the advantage of giving a microphysics meaning 
to the vector field. Such vector fields have also been argued to arise naturally from dimensional 
reduction of higher dimensional gravity theories [351 1262| . or, more generally, to be necessary from 
the fact that quantum gravity could need a preferred rest frame [207| in order to protect the theory 
against instabilities when allowing for higher derivatives to make the theory renormalizable (e.g., 
in Hofava gravity [65 [ 1196) ). Inspired by this possible need of a preferred rest frame in quantum 
gravity, relativistic MOND theories boiling down to particular cases of GEA theories in which the 
vector field is hypersurface-orthogonal have, for instance, been proposed in [62 ) 1396) . 
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7.8 Bimetric theories 



In the previous theories, the acceleration-dependence of MOND enters the equations through a 
free "MOND function" f{X) acting either on the contracted gradient of an added scalar field, with 
dimensions of acceleration (Eq. I85p , or on a scalar formed with the first derivatives of a vector field 
(Eq. I94p with a unit-norm constraint relating it to the gradient of the potential in the physical 
metric. The "MOND function" could not act directly on the Christoffel symbol because this is 
not a tensor, and such a theory would thus violate general covariance. However, if there is more 
than one metric entering gravitation, the difference between the associated Christoffel symbols is 
a tensor, and one can construct from it a scalar with dimensions of acceleration, on which the 
"MOND function" can act. Such theories in which there are two dynamical rank-2 symmetric 
tensor fields are called bimetric theories [205} 12061 1370j . Milgrom [313| 1318) proposed to construct 
a whole parametrized class of bimetric MOND theories (dubbed BIMOND), involving an auxiliary 
metric, with various phenomenological behaviors in the weak-field limit, ranging from Bekenstein- 
Milgrom MOND to QUMOND as well as a mix of both (see [321] ). As one example (parameters 
a = —/3 = —1 in the general class of BIMOND theories, for which we refer the reader to the 
review [321j ). the auxiliary metric g^^ can e.g. be introduced precisely in the same way as the 
auxiliary potential $ph in the QUMOND classical action of Eq. [M] : 

S EE S'm [matter, g^^,] + S",!! [twin matter, g^^] + ^^^^ / d'^xy/^ [i? - i? - 21 ^/(Xbimond)], (96) 

where / = c^/ao, and 

-'^bimond = I'^g'^" {CflfjC^p - C'^i^C'^^), (97) 

where C^^, — F^^ — F^^,. The MONDian modification of gravity is thus introduced through the 
interaction between the space-time on which matter lives and the auxiliary space-time (on which 
some "twin matter" might live). This modification is acceleration-based since the interaction 
involves the difference of Christoffel symbols, playing the role of acceleration. By varying the 
action w.r.t. both metrics, we obtain two sets of Einstein-like equations, which boil down in the 
static weak- field limit to $ = — ^ and $ = — ^E* in Eq. [73] (so this yields the correct amount of 
gravitational lensing for normal photons w.r.t. the "matter metric" 5^jy), as well as the following 
generalized Poisson equations: 

= 47rGp+V.[/'(|V($-$)|Vao)V($-$)] and V^d = 47rG/5+V.[/'(|V($-$)| Vao)V(*-^)]- 

(98) 

or, equivalently, 

V^($ - $) = 47rG(p - p) and = 47rGp + V.[/'(|V($ - $)|Vao)V($ - $)]. (99) 

This is equivalent to QUMOND (Eq. [30)) if the matter and twin matter are well separated (which is 
natural if they repel each other), the function / playing the role of H in Eq.[34l with /'(Xbimond) — >■ 

for Xbimond > 1 and /'(Xbimond) -'^bimond -'^bimond < 1- Notc that the existence of 
this putative twin matter is far from being necessary (putting p = everywhere yields exactly 
QUMOND), but it might be suggested by the existence of the auxiliary metric within the theory. 
Again, it is mandatory to stress that the formulation of BIMOND sketched above is actually far 
from unique and can be suitably parametrized to yield a whole class of BIMOND theories with 
various phenomenological behaviors |313[I318|I321[ . For instance, in matter- twin matter symmetric 
versions of BIMOND {a = (3 = 1, see [321j V and within a fully symmetric matter- twin matter 
system, a cosmological constant is given by the zero-point of the MOND function, naturally of 
the order of 1, thereby naturally leading to A ^ Aq for the large-scale Universe. Matter and twin 
matter would not interact at all in the high-acceleration regime, and would repel each other in the 
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MOND regime (i.e., when the acceleration difference of the two sectors is smaU compared to ao), 
thereby possibly playing a crucial role in the Universe expansion and structure formation |317) . 

This promising broad class of theories should be carefully theoretically investigated in the 
future, notably against the existence of ghost modes [70j. At a more speculative level, this class of 
theories can be interpreted as a modification of gravity arising from the interaction between a pair 
of membranes: matter lives on one membrane, twin matter on the other, each membrane having 
its own standard elasticity but coupled to the other one. The way the shape of the membrane is 
affected by matter then depends on the combined elasticity properties of the double membrane, but 
matter response depends only on the shape of its home membrane. Interestingly, bimetric theories 
have also been advocated |257J to be a useful ingredient for the renormalizability of quantum 
gravity (although they currently considered theories with only metric interactions, not derivatives 
like in BIMOND). 

7.9 Dipolar dark matter 

As we have seen, many relativistic MOND theories do invoke the existence of new "dark fields" 
(scalar or vector fields), which, if massive, can even sometimes truly be thought of as "dark matter" 
enjoying non-standard interactions with barvonj^ (Sect. 7.6 and [83] ). The bimetric version of 
MOND (Sect. 7.8) also invokes the existence of a new type of matter, the "twin matter". This 
clearly shows that, contrary to common misconceptions, MOND is not necessarily about "getting 
rid of dark matter" but rather about reproducing the success of Milgrom's law in galaxies. It might 
require adding new fields, but the key point is that these fields, very massive or not, would not 
behave simply as collisionless particles. 

In a series of papers, Blanchet & Le Tiec [56] [57l [58l [59j |60l [61] have pushed further the idea that 
the MOND phenomenology could arise from the fundamental properties of a form of dark matter 
itself, by suggesting that dark matter could carry a space- likel^ four-vector gravitational dipole 
moment following the analogy between Milgrom's law and Coulomb's law in a dieletric medium 
proposed by [57] (see Eq. [9]) or between the Bekenstein-Milgrom modified Poisson equation and 
Gauss' law in terms of free charge density (see Eq. II 7|) . The dark matter medium is described as 
a fluid with mass current J'^ = pu^ (where p is the equivalent of the mass density of the atoms in 
a dielectric medium, i.e., it is the ordinary mass density of a pressureless perfect fluid, and u^^ is 
the four-velocity of the fluicQ) endowed with the dipole moment vector (which will affect the 
total density in addition to the above mass density p), with the following action |61j : 



where P is the norm of the projection perpendicular to the four-velocity (not the norm of the 
polarization fielcjffl) of the polarization field = p^'^, and where the dot denotes the covariant 
proper time derivative. The specific dynamics of this dark matter fiuid will thus arise from the 
coupling between the current and the dipolar field (analogue to the coupling to an external polar- 
ization field in electromagnetism), as well as from the internal non- gravitational force acting on 
the dipolar dark particles and characterized by the potential W{P). Let us note that the normal 
matter action and the gravitational Einstein-Hilbert action are just the same as in GR. 

The equations of motion of the dark matter fluid are then gotten by varying the action w.r.t. 
the dipole moment variable and w.r.t. to the current J^, boiling down in the non-relativistic 

^■^ In the case of TeVeS and GEA theories, the dark fields do not really count as dark matter because their energy 
density is subdominant to the baryonic one. 

This is to be contrasted with the time-like nature of TeVeS and GEA vector fields in the static weak-field limit 
^*And the current is conserved, i.e., Vp = 

®^ It can be shown that only the projection perpendicular to the four-velocity enters the field equations deduced 
from the action of Eq. llOOl Thus the dipole moment is always fully space-like. 




(100) 



97 



limit to: 




(101) 



^ = f + -V[W{P) - PW'{P)] + (PV)g, 



(102) 



where v is the ordinary velocity of the fluid, g = — V<f> is the gravitational field, and f = 
— (P I P)W' I p is the internal non-gravitational force field making the dark particles motion non- 
geodesic. What is more, the Poisson equation in the weak-field limit is recovered as: 



In order to then reproduce the MOND phenomenology in galaxies, the next step is the so-called 
"weak-clustering hypothesis", namely the fact that, in galaxies, the dark matter fluid does not 
cluster much (p <C Pb) and is essentially at rest (v = 0) because the internal force of the fluid 
precisely balances the gravitational force, in such a way that the polarization fleld P is precisely 
aligned with the gravitational one g, and g oc —W'{P). The potential thus plays the role of the 
"MOND function" , and e.g. choosing to determine it up to third order in expansion as 



then yields the desired MOND behavior in Eg. 11031 with the n — 1 "simple" /i- function (see Eqs.H^ 
andlig]). 

This model has many advantages. The monopolar density of the dipolar atoms p will play the 
role of CDM in the early Universe, while the minimum of the potential W{P) naturally adds a 
cosmological constant term, thus making the theory precisely equivalent to the ACDM model for 
expansion and large scale structure formation. The dark matter fluid behaves like a perfect fluid 
with zero pressure at flrst order cosmological perturbation around a FLRW background and thus 
reproduces CMB anisotropics. Let us also note that, if the potential W{P) deflning the internal 
force of the dipolar medium is to come from a fundamental theory at the microscopic level, one 
expects that the dimensionless coefficients in the expansion all be of order unity after rescaling by 
ttg, thus naturally leading to the coincidence A ^ a^. 

However, while the weak clustering hypothesis and stationarity of the dark matter fluid in 
galaxies are suppported by an exact and stable solution in spherical symmetry [59 , it remains to 
be seen whether such a configuration would be a natural outcome of structure formation within this 
model. The presence of this stationary DM fiuid being necessary to reproduce Milgrom's law in 
stellar systems, this theory looses a bit of the initial predictability of MOND, and inherits a bit of 
the flexibility of CDM, inherent to invoking the presence of a DM fiuid. This DM fluid could, e.g., 
be absent from some systems such as the globular clusters Pal 14 or NGC 2419 (see Sect. 6.6.3), 
thereby naturally explaining their apparent Newtonian behavior. However, the weak clustering 
hypothesis in itself might be problematic for explaining the missing mass in galaxy clusters, due 
to the fact that the MOND missing mass is essentially concentrated in the central parts of these 
objects (see Sect. 6.6.4). 

7.10 Non-local theories and other ideas 

All the models hereabove somehow invoke the existence of new "dark fields", notably because 
for local pure metric theories, the Hamiltonian is generically unbounded from below if the action 
depends on a finite number of derivatives [82] I137| 1442) . A somewhat provocative solution would 
thus be to consider non-local theories. A non-local action could, e.g., arise as an effective ac- 
tion due to quantum corrections from super- horizon gravitons |441) . Deffayet, Esposito-Farese & 
Woodard |124| have notably exhibited the form that a pure metric theory of MOND could take 



V.(g-4^P) = 4^G(pb-hp). 



(103) 



W{P) oc A/(87r) + 2nP^ + 16n^P^/{3ao) + 0{P*) 



(104) 
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in order to yield MONDian dynamics and MONDian lensing for a static, spherically symmetric 
baryonic source. 

In such a static spherically symmetric geometry, the Einstein-Hilbert action of Eq. (BHl can be 
rewritten in the weak- field expansion as |124J : 

4 f 

S'eh = surfaccterm + / d^xf-rafo' + 12 + 0(a^ it")], (105) 

IdttG J 

where (1 + a) and —(1 + b) are the weak-field grr and 500 components of the static weak-field 
metric, respectively. The MOND modification to this action implies to obtain as a solution in 
the deep-MOND limit a = rb' ^ 2{GMaf)Y/'^ / , where the first equality ensures that lensing and 
dynamics are consistent, leading to the following tentative action in the ultra- weak-field limit [124] : 



'S'mond 



(106) 



where / = /a^ and a is an arbitrary constant. While it is impossible to express this form of the 
action as a local functional of a general metric, Deffayet et al. |124| showed that it was entirely 
possible to do so in a non-local model, making use of the non-local inverse d'Alembertian and of a 
TeVeS-like vector field, introduced not as an additional "dark field" , but as a non-local functional 
of the metric itself (by e.g. normalizing the gradient of the volume of the past light-cone). A whole 
class of such models is constructible, and a few examples are given in [124] . for which stability 
analyses are still needed, though. 

As already mentioned in Sect. 6.1.1, this non- locality was also inherent to classical toy models of 
"modified inertia" . In GR, this would mean making the matter action of a point particle (Eq. iTOl) 
depend on all derivatives of its position, but such models are very difficult to construct ^301j and 
no fully-fledged theory exists along these lines. A few interesting heuristic ideas have however been 
proposed in this context. For instance, Milgrom [305' proposed that the inertial force in Newton's 
second law could be defined to be proportional to the difference between the Unruh temperature and 
the Gibbons-Hawking one. It is indeed well-known that, in Minkowski space-time, an accelerated 
observer sees the vacuum as a thermal bath with a temperature proportional to the observer's 
acceleration Tu = ah/ {A-K^kc) [IIH 1471) where h is the Planck constant and k the Boltzmann 
constant. On the other hand, a constant-accelerated observer in de Sitter space-time (curved 
with a positive cosmological constant A) sees a non-linear combination of that vacuum radiation 
and of the Gibbons-Hawking radiation (with temperature Tgh — (A/3)^/^ft,/(47r^fc) [175) ') due 
to the cosmological horizon in the presence of a positive A. Namely, the Unruh temperature of 
the radiation seen by such an accelerated observer in de Sitter spacetime is [175] Tu = [a? + 
K/'iY/'^h/ {4,i:'^kc). The idea of Milgrom 305 is to then define the right-hand side of the norm 
of Newton's second law as being proportional to the difference between the two temperatures: 

|F| ^ ^ {Tu- Tgh), (107) 

which trivially leads to -F = m^{a/ a^))a with oq = c(A/3)^/^ (which is however observationally 
too large by a factor 27r) and the interpolating function ^{x) having the exact form of Eq. 1541 In 
short, observers experiencing a very small acceleration would see an Unruh radiation with a small 
temperature close to the Gibbons-Hawking one, meaning that the inertial resistance defined by the 
difference between the two radiation temperatures would be smaller than in Newtonian dynamics, 
and thus the corresponding acceleration would be larger. However, no relativistic version (if at 
all possible) of this approach has been developed yet: a few difficulties arise due to the direction 
of the acceleration, or by the fact that stars in galaxies are free-falling objects along geodesies, 
and not accelerated by a non-gravitational force as in the case of basic Unruh radiation. It was 
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interestingly noted [309] that the de Sitter space-time could be seen as a 4-diniensional pseudo- 
sphere embedded in a 5-dimensional flat Minkowski space, and that the acceleration of a constant- 
accelerated observer in this flat space would be exactly 05 = (a^ -|- Then , MOND could 
arise from symmetry arguments in this 5-dimensional space similar to those leading to special 
relativity in Minkowski space [309]. Interestingly, arguments very similar to this whole vacuum 
radiation approach have also recently been made in the context of entropic gravity |192| I193[ 12251 
I477J . Finally, another interesting idea to get MOND dynamics has been the tentative modiflcation 
of special relativity, making the Planck length and the length I = A^^/^ ^ c^/ao two new invariants 
in addition to the speed of light, an attempt known as Triply Special Relativity |234| . In any case, 
despite all these attempts, there is still no fully-fledged theory of MOND at hand which would 
derive from first principles, and the quest for such a formulation of MOND continues. 
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8 Gravitational Lensing in Relativistic MOND 



The viable MOND theories from the previous section, ahhough stiU mostly effective, have the 
great advantage of proving that constructing relativistic MOND theories is possible, and that it is 
thus possible to calculate from them the effects of gravitational lensing. But the non-uniqueness of 
the theories of course means that there is not really a unique prediction for gravitational lensing, 
especially in heavily time-dependent configurations, or when the predictions of the theories for 
the expansion history of the Universe deviate from the concordance model. As we have seen, 
some theories also deviate slightly from classical MOND predictions for dynamics of quasi-static 
systems, due to the presence of massive dark fields, and the same would of course happen for 
gravitational lensing. However, at the zeroth order, and in static weak-field configurations, we can 
make predictions for all theories whose expansion history would be similar to that of ACDM (see 
Sect. 9.1) and whose static weak- field limit is represented by a physical metric!^ with vj/ = — $ in 
Eq. [73](<i> obeying Eq. [T71). In this case, the way the light propagates on the nuU geodesies of this 
metric is exactly the same in all these theories once $ is known. What differs from GR is only the 
relation between $ and the underlying mass distribution of the lens. 



8.1 Strong lensing by galaxies 

When multiple images of a background source are produced by a gravitational lens, one talks about 
strong lensing. In that case, most of the light bending occurs within a small range around the lens 
compared to the lens-source distance Dig and the observer-source distance Dg (where the distances 
are the usual luminosity distances in cosmology). In this so-called thin-lens avvroximatiorW^ the 
resulting deflection angle can be written as: 

2 

a = — V±<!>dz, (108) 

J — CO 

where $ = — ^I* is the non- relativistic gravitational potential of Eq. [73] (obeying a MONDian 
Poisson equation), and Vj^ denotes the two-dimensional gradient operator perpendicular to light 
propagation. The lens equation then relates the observed two-dimensional angular position of the 
source in the lens plane 6 to its original angular position in the source plane f3 through: 

e = P+^a, (109) 

where it appears clearly that the expansion history will play an important role in converting 
redshifts to distances. It is also convenient to make the deflection angle a. derive from a deflection 
potential T in the lens-plane: 

poo 

c^DsDi 

If a source is much smaller than the angular scale on which the lens properties change, the lens 
equation Eg. I109l can locally be linearized as: 

/3(0)=/3o+^(0)(0-eo), (111) 

^'^This equality 'I' = — 3> in the weak-field metric is put in by hand in all TeVeS-like theories (Sects. 7.2 to 
7.6) through a disformal relation between the Einstein and physical metrics, and is a generic prediction of GEA 
(Sect. 7.7), BIMOND (Sect. 7.8) and DDM (Sect. 7.9) theories 

^"^ By this, we however do not mean that the MOND lensing can be computed from the projected surface density 
on the lens-plane as in GR, because the convergence parameter (Eg. 11131 below) is not a measure of the projected 
surface density anymore. This is also sometimes referred to as the "thin-lens approximation" in GR, and is not valid 
in MOND: two lenses with the same projected surface density can have different convergence parameters, because 
lensing also depends strongly on the distribution of the source mass along the line-of-sight in MOND. 
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where the inverse magnification matrix is 

A{6) = — , where ylii = 1 - K-71, A12 ^ A21 = -72, ^22 = 1 - k + 71 (112) 
The convergence k is directly given by the Laplacian of the deflection potential T: 

K=iv2T. (113) 

The so-caUed Einstein radius is the radius within the lens-plane within which the mean convergence 
is (k) = 1. The existence of a region where k is of that order is sufRcient to produce muhiple images 
and is the definition of strong lensing. On the other hand, the shear components 71,72 are given 

by 

_ 1 /a^T d'^T\ _ d^T 

~ 2 \ def " ' " aM^' ^^^^^ 

Due to Liouville's theorem, gravitational lensing preserves the surface brightness, but it changes 
the apparent solid angle of a source. The resulting flux ratio between image and source can be 
expressed in terms of the magnification M , 

M~' = (l-Kf-jf-jl (115) 

The flux ratio between two images A and B is Jab = Aa/Ab- Let us finally note that (i) the time- 
delay between the different images can be deduced directly from the lensing potential and depends 
on the Hubble constant and convergence at the Einstein radius, and that (ii) points in the lens 
plane where M^^ — (infinite magnification) form closed curves called the critical curves. Their 
corresponding curves located in the source plane are called caustics. The location of the source 
with respect to caustics determines the number of images, a source outside of the outermost caustic 
producing only one image while each caustic crossing changes the number of images by a factor of 
two. Spherically symmetric models of galaxy lenses can never produce observed quadruple-imaged 
systems because the innermost caustic of spherical models degenerates into a point. 

As outlined hereabove, what differs from GR in all the relativistic MOND theories is the 
relation between the non-relativistic potential $ and the underlying mass distribution of the lens 
p. However, different theories yield slightly different relations between $ and p in the weak-field 
limit (see especially Sect. 6.1 and Sect. 6.2). For instance, while GEA theories (Sect. 7.7) boil 
down to Eq. [T7]in the static weak-field limit, TeVeS (Sect. 7.4) leads to the situation of Eq. l40l 
and BIMOND (Sect. 7.8) to Eq. [30l However, like in the case of rotation curves (see Figure [20|l . 
the differences are only minor outside of spherical symmetry (and null in spherical symmetry), and 
the global picture can be obtained by assuming a relation given by the BM equation (Eq. [T7|) . 

The first studies of strong lensing by galaxies in relativistic MOND theories 94 , 502, _5Q8_ made 
use of the CfA-Arizona Space Telescope Lens Survey (CASTLES) and made a one parameter-fit 
of the lens mass to the observed size of the Einstein radius, both for point-mass models and for 
Hernquist spheres (with observed core radius). Zhao et al. jSOSj also compared the predicted and 
observed flux ratios /ab ■ They used the a = /i- function of Eq. [46l and concluded that reasonably 
good fits could be obtained with a lens mass corresponding to the expected baryonic mass of the 
lens. Shan et al. [420] then improved the modeling method by considering analytic non-spherical 
models with locally spherically symmetric isopotentials on both sides of the symmetry plane z = 0, 
implying no curl field correction (S = 0) in Eq. [191 The MOND non-relativistic potential $ can 
then analytically be written, and using Eq. 11081 one can analytically compute the two components 
ai and a2 of the deflection angle vector a as a function of the three parameters of the model, 
namely the lens-mass and two scale-lengths controlling the extent and flattening of the lens (see 
Eq. 18 of [420) 1. Using the lens equation hereabove (Eg. I109p . one can then trace back light-rays 
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(a) 



(b) 




Q2237 



a 



Figure 41: (a) The four images of the quasar Q2237+030 (known as the Einstein cross), gravi- 
tationally lensed by an isolated bulge-disk galaxy known as Huchra's lens |198| . © ESA's faint 
object camera on HST. (b) The empty squares denote the four observed positions of the images, 
and the filled square denotes the MOND-fit unique position of the source |420J . The critical curves 
for which = in the lens plane are displayed in black, and their corresponding caustics in 

the source plane in red. 

for each observed image to the source plane and fit the lens parameters as well as its inclination 
in order for the source position to be the same for each image. The quality of the fit is thus 
quantified by the squared sum of the source position differences. This notably allowed |420| to 
fit in MOND the famous quadruple-imaged system Q2237+030 known as the Einstein cross (see 
Figure [IT]) , a quasar gravitationally lensed by an isolated bulge-disk galaxy [198] . For three other 
quadruple-imaged systems of the CASTLES survey, the fits were however less successful mostly 
because of the intrinsic limitations of the analytic model of Shan et al. |420j at reproducing at the 
same time both a large Einstein radius and a large shear. What is more it does not take into 
account the effects of the environment in the form of an external shear, which is also often needed 
in GR to fit quadruple-imaged systems. For 10 isolated double-imaged systems in the CASTLES 
survey, the fits were much more succesfu^l. For non-isolated systems however, especially for those 
lenses residing in groups or clusters, the need for an external shear might be coupled to a need for 
dark mass on galaxy group scales (see Sect. 6.6.4 and Sect. 8.3). 

Due to the fact that all the above models were using the so-called Bekenstein /i-function (a = 
in Eq. US]) , and that this function has a tendency of slightly underpredicting stellar mass-to- light 
ratios in galaxy rotation curves fits [146] , it was claimed that this was a sign for a MOND missing 
mass problem in galaxy lenses [1531 11541 1263] . While such a missing mass is indeed possible, 
and even corroborated by some dynamical studies |365| of galaxies residing inside clusters (i.e., 
the small-scale equivalent of the problem of MOND in clusters), for isolated systems with well- 
constrained stellar mass-to-light ratio, the use of the simple /z-function (a = 1 in Eq. [46|l has 
on the contrary been shown to yield perfectly acceptable fits [95] in accordance with the lensing 
fundamental plane [398) . 

Finally, the probability distribution of the angular separation of the two images in a sample 
of lensed quasars has been investigated by Chen [91] [92]. This important question has proved 

Note that, in order for the problem to be well constrained, a regularization method was used in order to penalize 
solutions deviating from the fundamental plane as well as face-on solutions and solutions with an anomalous flux 
ratio or M/L ratio (see Eq. 21 of | 420| . 
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somewhat troublesome for the ACDM paradigm, but is well explained by relativistic MOND the- 
ories inj- 

8.2 Weak lensing by galaxies 

A gravitational lens does not only produce multiple images close to caustics, but also weakly 
distorted images (arclets) of other background sources. The weak and noisy signals from several 
individual arclets (not necessarily detected by eye, but rather numerically exploited with the help 
of image analysis) can be averaged by statistical techniques to get the shear components 71 and 
72 in Eq. 11141 from the mean ellipticity of the images. One can then get the convergence k, from 
the azimuthal average of the tangential component of the shear. This is what is known as weak 
lensing. In the case of galaxy-galaxy weak lensing, since the gravitational distortions induced by 
an individual lens are too small to be detected, one has to resort to the study of the ensemble 
averaged signal around a large number of lenses. This has been investigated in the context of 
MOND for a sample of relatively isolated galaxy-lenses, stacked by luminosity ranges 45 7^ . The 
derived MOND masses were obtained by fitting a point mass model to the lensing data within a 
distance of 200 kpc from the lens. While the MOND masses are perfectly compatible with the 
baryonic masses in all galaxies less luminous than 10"'^^ Lq, it was found that the required MOND 
mass-to-light ratios tended to be slightly too high [M / L ~ 10) for the most massive an luminous 
galaxies {L > 10^^ i©). However, this whole result is dictated by only one data point which 
"pulls up" the result and make all the data points lie below the "best fit', and the curve is "pulled 
up" strongly by only the first point. The mass-to-light ratios could thus easily be scaled down 
by a factor of two, making these galaxies in perfect agreement with MOND. But it is also worth 
noting that due to the very large distances probed, the presence of some weakly clustering residual 
mass (hot dark matter, or some sort of "dark field" in the relativistic MOND theories) could start 
playing a role at these distances. While ordinary neutrinos are still too weakly clustering, a slightly 
more massive fermion such as a lOeV-scale sterile neutrino could cluster on these scales, and of 
course, the presence of baryonic dark matter in the form of dense molecular gas clouds could also 
be present around these very massive objects (see Sect. 6.6.4). 

Also related to weak lensing, it is important to recall that the "phantom dark matter" of 
MOND (Eq. [33|) can sometimes become negative in cones perpendicular to the direction of the 
external gravitational field in which a system is embedded: with accurate enough weak-lensing data, 
detecting these pockets of negative phantom densities around a sample of non-isolated galaxies 
could in principle be a smoking gun for MOND |491j . but such an effect would be extremely 
sensitive to the detailed distribution of the baryonic matter, and finding a sample of galaxies with 
similar gravitational environments would also be extremely difficult. 

8.3 Strong and weak lensing by galaxy clusters 

Gravitational lensing is a complementary technique to the hydrostatic equilibrium of the X-ray 
emitting gas (Sect. 6.6.4) to probe the mass distribution of galaxy clusters. Since clusters are the 
most recently formed structures, they could be slightly out of equilibrium, which makes gravi- 
tational lensing extremely interesting as this technique is fully independent from the relaxed or 
unrelaxed nature of the lens. A famous example of such a clearly unrelaxed object is the cluster 
1E0657-56, known as the Bullet Cluster (Figure |42|) . It is actually a pair of clusters which collided 
at high-speed (> 3100 km/s) bX z — 0.3. In the collision, the dissipational hot X-ray emitting gas 
which dominates the baryonic matter was separated from the negligible and collisionless galaxies 
and any presumed collisionless dark matter. Using background galaxies to map the shear field, the 
convergence map of the cluster was provided by |103J . a convergence very conspiciously centered 
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where the cohisionless dark matter should be El- It would appear difficult to reconstruct such a 
configuration merely by modifying gravity, but the non-linearity of MOND does not guarantee that 
the convergence from a two-center baryonic distribution would be indeed centered on the two cen- 
ters. Indeed, while the linear relation between the matter density and the gravitational potential 
implies that the convergence parameter is a direct measurement of the projected surface density 
in the weak-field limit of GR, this is not the case anymore in MOND due to the non-linearity of 
the modified Poisson equation. Actually, it has been shown that, in MOND, it is possible to have 
a non-zero convergence along a line of sight where there is zero projected matter |15| . What is 
more, the gravitational environment might play an important role on the internal gravitational 
field too [1141 1260J , and the additional degrees of freedom of the various relativistic theories might 
play a non- negligible role, especially in non-static situations |113] . Neglecting possible effects of 
the gravitational environment and non-trivial features of the additional fields of the relativistic 
theories out of equilibrium, i.e., simply assuming that the physical metric is given by \l/ = — $ in 
Eq. [73l and that $ obeys Ea. [T7|) . a MOND model of the bullet cluster was produced [17], in which 
a parametrized potential was fitted to the convergence map to then determine the underlying mass 
distribution from Eq . ll7l The result is displayed on Figure l43| and exactly the same conclusion was 
reached by going from the baryonic density to the convergence map |148) . The main conclusions 
are that (i) the amount of residual missing mass needed to account for the convergence map of the 
bullet cluster is the same as in all other clusters (Sect. 6.6.4 and [4501 ), but that (ii) if it is made 
of dark baryons, they must be in a collisionless form, since the residual missing mass is centered 
on the collisionless galaxies and not on the dissipational hot gas. The dense molecular gas clouds 
proposed by Milgrom |311j (see discussion in Sect. 6.6.4) satisfy this criterion, and would mostly 
behave like individual stars. Like in most clusters with T > 4keV, ordinary neutrinos with a 2eV 
mass would be broadly sufficient to account for the missing mass deduced from weak lensing (and, 
obviously, heavier exotic hot dark matter particles such as 10 eV sterile neutrinos would do the job 
too). 

For TeVeS (Sect. 7.4) and GEA (Sect. 7.7), the growth of the spatial part of the vector per- 
turbation in the course of cosniological evolution can successfully seed the growth of baryonic 
structures, just as dark matter does, and it is possible to reconstruct the gravitational field of the 
bullet cluster without any extra matter but with a substantial contribution from the vector field. 
However, why the dynamical evolution of the vector field perturbations would lead to precisely 
such a configuration remains unclear. Similarly, the massive scalar field of Sect. 7.6 or the monopo- 
lar part of the dipolar DM of Sect. 7.9 could in principle provide the off-centered missing mass 
too, but again, why they would appear distributed as they do remains unclear, especially in the 
case of dipolar DM which is supposed to cluster only very weakly, and in principle not to appear 
as densely clustered. Whether the twin matter of BIMOND (Sect. 7.8) could help providing the 
right convergence map also remains to be seen, while for non-local models (Sect. 7.10), there is 
a strong dependence on the past light-cone, meaning that recently disturbed systems such as the 
Bullet may be far from the static MOND limit (but in that case, it would not be clear why all the 
other clusters from Sect. 6.6.4 exhibit the same amount of residual missing mass). So, while the 
bullet cluster clearly does not represent the MOND-killer that it was supposed to be, explaining 
its convergence map remains an outstanding challenge for all MOND theories. However, the bullet 
cluster also represents an outstanding challenge to ACDM (see Sect. 4.2), due to its high collision 
speed [250) . In that respect, MOND is much more promising [16[ . 

On the other hand, a comprehensive weak lensing mass reconstruction of the rich galaxy cluster 
C10024-f 17 aX z — 0.4 [211] has been argued to have revealed the first dark matter structure that 
is offset from both the gas and galaxies in a cluster. This structure is ringlike, located between 

^^Note, however, that this is not always the case in colUding clusters: Abell 520 actually provides a counter- 
example to the bullet cluster in which the mass peaks indicated by weak lensing do not behave as collisionless 
matter should 212l . 
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r ^ 60" and r ^ 85". It was, again, argued to be the result of a collision of two massive clusters 
1-2 Gyr in the past, but this time along the line-of-sight. It has also been argued [211] that this 
offset was hard to explain in MOND. Assuming that this ringlike structure is real and not caused 
by instrumental bias or spurious effects in the weak lensing analysis (due, e.g., to the unification 
of strong and weak- lensing or to the use of spherical/circular priors), and that cluster stars and 
galaxies do not make up a high fraction of the mass in the ring (which would be too faint to observe 
anyway), it has been shown that, for certain interpolating functions with a sharp transition, this 
is actually natural in MOND |326j . A peak in the phantom dark matter distribution generically 
appears close to the transition radius of MOND rt = {GM / a^Y^"^ , especially when most of the mass 
of the system is well-contained inside this radius (which is the case for the cluster C10024-I-17). This 
means that the ring in C10024-I-17 could be the first manifestation of this pure MOND phenomenon, 
and thus be a resounding success for MOND in galaxy clusters. However, the sharpness of this 
phantom dark matter peak strongly depends on the choice of the /i-function, and for some popular 
ones (such as the "simple" /Lt-function) the ring cannot be adequately reproduced by this pure 
MOND phenomenon. In this case, a coUisional scenario would be needed in MOND too, in order 
to explain the feature as a peak of cluster dark matter. Indeed, we already know that there is a 
mass discrepancy in MOND clusters, and we know that this dark matter must be in collisionless 
from (e.g., neutrinos or dense clumps of cold gas). So the results of the simulation with purely 
collisionless dark particles [211] would surely be very similar in MOND gravity. Again, it was 
shown that the density of missing mass was compatible with 2 eV ordinary neutrinos, like in most 
clusters with T > 4 keV }140) . Finally, let us note that strong lensing was also recently used as a 
robust probe of the matter distribution on scales of 100 kpc in galaxy clusters, especially in the 
cluster Abell 2390 |150j . A residual missing mass was again found, compatible with the densities 
provided by fermionic hot dark matter candidates only for masses of ~ 10 eV and heavier. All 
in all, the problem posed by gravitational lensing from galaxy clusters is thus very similar to the 
one posed by the temperature profiles of their X-ray emitting gas (Sect. 6.6.4), and remains one 
of the two main current problems of MOND, together with its problem at reproducing the CMB 
anisotropics (see Sect. 9.2). 

Finally, let us note in passing that another (non-lensing) test of relativistic MOND theories 
in galaxy clusters has been performed by analysing the gravitational redshifts of galaxies in 7800 
galaxy clusters [490], which were originally found to be difficult to reconcile with MOND: however, 
this original analysis assumed a distribution of residual missing mass in MOND by simply scaling 
down the Newtonian dynamical mass represented by a NFW halo by a factor 0.8, and the analysis 
confused the interpolating functions ii{x) and /i(s) (see Sect. 6.2). A subsequent analysis [H] 
showed that these gravitational redshifts were in accordance with relativistic MOND when the 
correct residual mass and acceptable /i-functions were used. 
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Figure 42: The bullet cluster 1E0657-56. The hot gas stripped from both subclusters after the 
collision is colored red-yellow. The green and white curves are the isocontours of the lensing 
convergence parameter k (Eq. I113|) . The two peaks of k do not coincide with those of the gas 
which makes up most of the baryonic mass, but are skewed in the direction of the galaxies. The 
white bar corresponds to 200 kpc. Figure courtesy of D. Clowe. 
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Figure 43: A MOND model of the bullet eluster [17 . The fitted K-map (solid black lines) is 
overplotted on the convergence map of [103] (dotted red lines) . The four centres of the parametrized 
potential used are the red stars. Also overplotted (blue dashed line) are two contours of surface 
density. Note slight distortions compared to the contours of k. The green shaded region corresponds 
to the clustering of 2 eV neutrinos. Inset: The surface density of the gas in the model of the bullet 
cluster. 
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8.4 Weak lensing by large-scale structure 

The weak- lensing method can also be applied on larger scales, i.e., mapping the shear-field induced 
by large-scale structures. On these scales, the metric of the expanding Universe forming structure 
is well represented by a Newtonianly perturbed Friedmann-Lemaitre-Robertson- Walker (FLRW) 
metric: 

90^ = fto = , = - (1 + 2$) , g^j - a{tf (1 + 2*) 5,,, (116) 

where a{t) is the scale factor. Like in the static weak-field case (Eg. [73]) . $ is the non-relativistic 
potential in units of c^, hut the equality \1/ = — $ in Eq. [73] does not necessarily imply the equality 
in Eq. 11161 In GR, this equality is actually respected for both cases (apart for perturbations 
around a FLRW background sourced by anisotropic stress), but the relativistic MOND theories, 
which have been constructed in order to yield the equality for the static weak-field limit in Eq. 1731 
do not harbor this equality in the perturbed FLRW case, and the quantity $ -I- is referred to as 
the gravitational slip. For instance, in the TeVeS (Sects. 7.3 and 7.4) and GEA (Sect. 7.7) theories, 
based on unit-norm vector fields, the equality is broken due to the growth of vector perturbations 
in the course of cosmological evolution (see e.g. [129] and Sect. 9.2). 

Like in the static case, weak gravitational lensing from large-scale structure will actually depend 
on $ — 4", whereas galaxy clustering will arise only from the non-relativistic potential $. By 
combining information on the matter overdensity at a given redshift (obtained from measuring the 
peculiar velocity field) and on the weak lensing maps, Zhang et al. [499] proposed a clever method 
to observationally estimate <& — '!'. This allowed Reyes et al. |363j to use luminous red galaxies in 
the SDSS survey in order to exclude one model from the original TeVeS theory (Sect. 7.3) with the 
original f{X) function of [31], thus explicitly showing how such measurements could be a possible 
future smoking-gun for all theories based on dynamical vector fields. But note that other MOND 
theories such as BIMOND would not be affected by such measurements. 

Let us however finally note a caveat in the interpretation of the weak lensing shear map in the 
context of relativistic MOND. While intercluster filaments negligibly contribute to the weak lensing 
signal in GR, a single filament inclined by 7r/4 from the line of sight can cause substantial distortion 
of background sources pointing toward the filament's axis in relativistic MOND theories [149] . Since 
galaxies are generally embedded in filaments or are projected on such structures, this contribution 
should be taken into account when interpreting weak lensing data. This additional difficulty for 
interpreting weak-lensing data in MOND is not only true for filaments, but more generally for all 
low-density structure such as sheets and voids. 
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9 MOND and Cosmology 



9.1 Expansion history 

A viable theory of modified gravity, including dark fields or not, should not only be able to 
reproduce observations in quasi-stationary galactic and extragalactic systems, but also to reproduce 
all of the major probes of observational cosmology, including (i) the Hubble diagram out to large 
z, (ii) the anisotropics in the cosmic microwave background (CMB), and (iii) the matter power 
spectrum on large scales. The first requires a detailed knowledge of FLRW cosmology, and the last 
two a knowledge of cosmological perturbations on a FLRW background. 

Concerning the first point, the FLRW solutions have been extensively studied for TeVeS 
(Sects. 7.3 and 7.4, see e.g. [71]) and GEA (Sect. 7.7, see e.g. [516]) theories, for BIMOND (Sect. 7.8, 
see e.g. |102| ), and for theories based on dipolar dark matter (Sect. 7.9, see e.g. [ST,). In the latter 
case, the theory [59[ [ST] has been shown to be strictly equivalent to ACDM out to first order 
cosmological perturbations (but very different in the galaxy formation regime), together with a 
natural explanation for A Qq. For the other theories, it has been shown that the contribution of 
the extra-fields to the overall expansion is subdominant to the baryonic mass and does not affect 
the overall expansion |152) . Such theories can predict an extremely wide range of cosmological 
behavior, ranging from accelerated expansion to contraction on a finite time scale [71) . The key 
point is that the expansion history mainly depends on the form of the "MOND function" f{X) for 
the unconstrained domain A" < in any of these theories. 

For instance in TeVeS, A cx (Vi/))^ > in static configurations (see Eq. [55]), and A cx 
— 2(90/9^)^ in evolving homogeneous and isotropic configurations such as the expanding Universe. 
The form of /(A) is clearly constrained from the MOND phenomenology only for A > 0, meaning 
that a lot of freedom exists for A < 0. Exactly the same is true in GEA and BIMOND theo- 
ries [102] . For instance, Bekenstein |34i originally proposed for TeVeS a /'-function (corresponding 
to /i, see Eg. [7^ with a discontinuity at A = (the B04 function on Figure [TT)) not enabling galax- 
ies to collapse continuously out of the Hubble expansion. Afterwards, Zhao & Famaey proposed an 
improved "mirror- function" /'(A) such that the corresponding /i- function reproduces the simple 
/x-function (a = 1 in Eq. l46|) for A > 0, and /(A) = /(—A) for the cosmological regime A < (see 
Figure I44[ leading to an acceptable expansion history. However, when connecting a static galaxy 
to the expanding Universe, the limit /i(0) ~ would predict the existence of a singular surface 
around each galaxies on which the scalar degree of freedom does not propagate, meaning that it is 
better to reconnect the two sides at /i(0) ~ e (see Sect. 6.2). In addition, the integration constant 
/(O) can play the role of the cosmological constant |185] to drive accelerated expansion, but even 
some /(O) = models can drive late-time acceleration [126] . which is not surprising since k-essence 
scalar fields were also introduced to address the dark energy problem. In the case of BIMOND (see 
Sect. 7.8), a symmetric matter-twin matter early Universe yields a cosmological constant through 
the zero-point of the MOND function, thereby naturally leading to A ^ a§. 

All in all, with the additional freedom of a hypothetical dark component in the matter sector, 
in the form of e.g. ordinary or sterile neutrinos, playing with the form of /(A) for A < in 
TeVeS, GEA and BIMOND always allows one to reproduce an expansion history and a Hubble 
diagram almost precisely identical to ACDM, justifying the assumption made in Sect. 8 to assume 
this expansion history for gravitational lensing in relativistic MOND. However, it is important to 
note that MOND theories are not providing a unique prediction on this. 

9.2 Large scale structure and Cosmic Microwave Background 

Modified gravity theories should of course not only produce a reasonable Hubble expansion but 
also reproduce the observed anisotropics in the CMB, and the matter power spectrum. Taken 
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Figure 44: In solid blue, the Zhao-Famaey [509] /i(s)-function (Eq. [79|) of TeVeS (Sect. 7.3 and 
7.4), compared to the original Bekenstein one (dashed green) with a discontinuity at s = [34] . 
The ZF function provides a more natural transition from static systems (the positive side) to 
cosmology (the negative side). 
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at face value, these require not only dark matter, but non-baryonic cold dark matter. Any al- 
ternative theory must account for these, just as dark matter models need to explain galaxy scale 
phenomenology. 

Using the hypothesis that the Universe is filled with some form of cold dark matter, it is possible 
to simultaneously fit observations of the CMB [230] and provide an elegant picture for the growth 
of large scale structure [445] . An obvious question is thus how MOND fares with these subjects. Of 
course, as we have seen, there is no unique existing MOND theory (Sect. 7), and the basic theory 
underlying MOND as a paradigm is probably yet to be found. Nevertheless, we can make a few 
general considerations about how any MOND theory should behave, and then look in more details 
at specific predictions from existing relativistic theories. The general picture is that, in some ways 
MOND does surprisingly well, in others it clearly gives no real unique prediction by now, and in 
still others it appears to fail outright. 

If one alters the force law as envisioned by MOND, the effective long range force becomes 
stronger. Though details will of course depend on the specific relativistic theory, we can speculate 
about the consequences of a MOND-like force in cosmology. Note however that most of what follows 
cannot be rigorously justified at the moment for lack of a compelling unique underlying theory. 
But obviously, because of the stronger force, dynamical measures of the cosmic mass density will 
be overestimated, just as in galaxies. Applying MOND to the peculiar motions of galaxies yields 
J7m w fib |279J . There are large uncertainties in estimating the extragalactic peculiar acceleration 
field, so this merely shows that MOND might alleviate the need for non-baryonic dark matter 
inferred conventionally from flm > ^b- 

The stronger effective gravitational attraction of MOND would change the growth rate of 
perturbations. Instead of adding dark mass to speed the growth of structure, we now rely on the 
modified force law to do the work. While it is obvious that MOND will form structures more rapidly 
than conventional gravity with the same source perturbation, we immediately encounter a challenge 
posed by the non-linear nature of the theory, precluding an easy linear perturbation analysis. 
One can nevertheless sketch a naive overview of how structure might form under the influence of 
MOND. The following picture emerges from numerical calculations of particles interacting under 
MOND in an assumed background |386[ 13421 12271 1251] , and is thus obviously slightly (or very) 
different from the various relativistic MOND theories of Sect. 7 and from those yet to be found, 
especially from those MONDian theories involving the existence of some form of dark matter (twin 
matter, dipolar dark matter, etc.). In the early Universe, perturbations cannot grow because 
the baryons are coupled to the photon fluid. The mass density is lower, so matter domination 
occurs later than in ACDM. Consequently, MOND structure formation initially has to lag behind 
ACDM at very high redshift (z > 200). However, as the influence of the photon field declines 
and perturbations begin to enter the MOND regime, structure formation rapidly speeds up. Large 
galaxies may form by 2; w 10 and clusters by 2; w 2 [TT ] 1386] . considerably earlier than in ACDM. 
By z = 0, the voids have become more empty than in ACDM, but otherwise simulations (of 
collisionless particles, which is of course not the best representation of the baryon fluid) show 
the same qualitative features of the cosmic web |227l 1251] . This similarity is not surprising since 
MOND is a subtle alteration of the force law. The chief difference is in the timing of when 
structures of a given mass appear, it being easier to assemble a large mass early in MOND. This 
means that MOND is promising in addressing many of the challenges of Sect. 4.2, namely the 
high-z clusters challenge [11] and Local Void challenge, as well as the bulk flow challenge and high 
collisional velocity of the bullet cluster [TO] 1252) , again due to the much larger than Newtonian 
MOND force in the structure formation context. What is more, it could allow large massive 
galaxies to form early (z w 10) from monolithic dissipationless collapse |;393 , with well-defined 
relationships between the mass, radius and velocity dispersion. Consequently, there would be less 
mergers than in ACDM at intermediate redshifts, in accordance with constraints from interacting 
galaxies (see Sect. 6.5.3), which could explain the observed abundance of large thin bulgeless disks 



112 



unaffected by major mergers (see Sect. 4.2), and in ttiose rare mergers between large spirals, tidal 
dwarf galaxies would be formed and survive more easily (see Sect. 6.5.4). This could lead to the 
intriguing possibility that most dwarf galaxies are not primordial but have been formed tidally in 
these encounters [240]. These populations of satellite galaxies, associated with globular clusters 
that formed along with them, would naturally appear in (more than one) closely related planes 
(because a gas-rich galaxy pair undergoes many close encounters in MOND before merging, see 
Sect. 6.5.3), thereby perhaps providing a natural solution to the Milky Way satellites phase-space 
correlation problem of Sect. 4.2. What is more, the density-morphology relation for dwarf ellipticals 
(more dE galaxies in denser environments [240]), observed in the field, in galaxy groups and in 
galaxy clusters could also find a natural explanation. 

Actually, the chief problem seems not to be forming structure in MOND, but the danger of 
over-producing it |342| 1399] . The amplitude of the power spectrum is well measured at z = 1091 
in the CMB and at z « by surveys like the Sloan Digital Sky Survey. Simulations normalized 
to the CMB overproduce the structure at z = by a factor of ~ 2. Given the uncertainty in 
the parent relativistic theory and hence the appropriate form of the expansion history, this seems 
remarkably close. Given the non-linear nature of the theory, MOND could easily have been wrong 
by many orders of magnitude in this context. Nevertheless, it may be necessary to somehow 
damp the growth of structure at late times |399| . In this regard, a laboratory measurement of 
the ordinary neutrino mass might be relevant. Conventional structure cannot form in ACDM if 
m,y > 0.2 eV [230] . In contrast, some modest damping from a non-trivial neutrino mass might be 
desirable in MOND, and is also relevant to the CMB and clusters of galaxies (see Sect. 6.6.4). 

In addition to mapping the growth factor as a function of redshift, one would also like to 
predict the power spectrum of mass fluctuations as a function of scale at a given epoch. It is 
certainly possible to match the power spectrum of galaxies at z = |399| . but because of MOND's 
non-linearity and the uncertainty in the background cosmology, it is rather harder to know if such 
a match faithfully represents a viable theory. Indeed, a natural prediction of baryon dominated 
cosmologies is the presence of strong baryon acoustic oscillations in the matter power spectrum at 
z = |268[ I128j . Dodelson (I28i portrays this as a problem, but as already pointed out in j268j , 
the non-linearity of MOND can lead to mode mixing that washes out the initially strong signal by 
z = 0. A more interesting test would be provided by the galaxy power spectrum at high redshift 
(z ^ 5). This is a challenging observation, as one needs both a large survey volume and high 
resolution in /c-space. The latter requirement arises because the predicted features in the power 
spectrum are very sharp. The window functions necessarily employed in the analysis of large scale 
structure data are typically wider than the predicted features. Convolution of the predicted power 
spectrum with the SDSS analysis procedure |327) shows that essentially all the predicted features 
wash out, with the possible exception of the strongest feature on the largest scale. This means 
that the BAO signal detected by SDSS and consistent with ACDM [136] could also be interpreted 
as a confirmation of the a priori prediction [268^ of such featuretlf^ in MOND. However, there is no 
definitive requirement that the BAO appears at the same scale as observed, or that it survives at 
all. In relativistic theories such as TeVeS (Sect. 7.3 and 7.4), damping of the baryonic oscillations 
can be taken care of by parameters of the theory such as K in original TeVeS (Eq. |84l see figure 3 
of |431j ) or the Ci coefficients in generalized TeVeS (Eq. l87p . In any case, as in standard cosmology, 
the angular power spectrum of the CMB should be a cleaner probe. 

A first attempt to address the CMB was made before the existence of relativistic theories with 
a simple ansatz [266': just as MOND returns precisely Newton in high accelerations, so any parent 
theory should contain GR (almost exactly, although this is not precisely the case for, e.g., TeVeS) 
in the appropriate strong-field limit. An obvious first assumption is that MOND effects do not 

^"Even if BAO features are present at high redshift in MOND, it is not clear that low redshift structures will 
correlate with the ISW in the CMB as they should in conventional cosmology because of the late time non-linearity 
of MOND. 
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Figure 45: The acoustic power spectrum of the cosmic microwave background as observed by 
WMAP [230] together with the a priori predictions of ACDM (red hne) and no-CDM (blue hne) 
as they existed in 1999 j266J prior to observation of the acoustic peaks. ACDM correctly predicted 
the position of the first peak (the geometry is very nearly flat) but over-predicted the amplitude 
of both the second and third peak. The most favorable a priori case is shown; other plausible 
ACDM parameters [469] predicted an even larger second peak. The most important parameter 
adjustment necessary to obtain an a posteriori fit is an increase in the baryon density flf, above 
what had previously been expected form big bang nucleosynthesis. In contrast, the no-CDM model 
ansatz made as a proxy for MOND successfully predicted the correct amplitude ratio of the first 
to second peak with no parameter adjustment I269| 1270] . The no-CDM model was subsequently 
shown to under-predict the amplitude of the third peak 1443] . 
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yet appear in the very early Universe, so that pure GR suffices for calculations concerning the 
CMB. The chief difference between ACDM and a MONDian cosmology is then just the presence or 
absence of non-baryonic cold dark matter. With this ansatz, we can make one robust prediction: 
the shape of the acoustic power spectrum should follow pure baryonic diffusion damping. There is 
no net forcing term, as provided by the extra degree of freedom of non-baryonic cold dark matter. 
With nothing but baryons, each acoustic peak should thus be lower than the previous one [426] as 
part of a simple damping tail (Figure l45|) . In contrast, there must be evidence of forcing present 
in a power spectrum where CDM outweighs the baryons. 

The density of both the baryons and the non-baryonic cold dark matter are both critical to the 
shape of the acoustic power spectrum. For a given baryon density, models with CDM with have a 
larger second peak than models without it. Similarly, the third peak is always lower than the second 
in purely baryonic models, while it can be either higher or lower in CDM models, depending on the 
mix of each type of mass. Moreover, both parameters were well constrained prior to observation of 
the CMB |469) : fif, from BBN |481) and from a variety of methods |117| . It therefore seemed hke 
a straightforward exercise to predict the difference one should observe. The most robust prediction 
that could be made was the ratio of the amplitude of the first to second acoustic peak ^6 . For 
the range of baryon and dark matter densities allowed at the time, ACDM predicted a range in 
this ratio anywhere from 1.5 to 1.9. That is, the first peak should be almost but not quite twice 
as large as the second, with the precise value containing the information necessary to much better 
constrain both density parameters. For the same baryon densities allowed by BBN but no dark 
matter, the models fell in a distinct and much narrower range: 2.2 to 2.6, with the most plausible 
value being 2.4. The second peak is smaller (so the ratio of first to second higher) because there 
is no driving term to counteract baryonic damping. In this limit, the small range of relative peak 
heights follows directly from the narrow range in i7b from BBN. 

The BOOMERanG experiment |118j provided the first data capable of testing this prediction, 
and was in good agreement with the no-CDM prediction [269) . This result was subsequently 
confirmed by WMAP, which measured a ratio 2.34 ± 0.09 [346] . This is in good quantitative 
agreement with the a priori prediction of the no-CDM ansatz, and outside the range first expected 
in ACDM. ACDM can nevertheless provide a good fit to the CMB power spectrum. The chief 
parameter adjustment required to obtain a fit is the baryon density, which must be increased: this 
is the reason for the near doubling of the long-standing value i7f,/i^ = 0.0125 [481] to the more 
recent f^bft.^ ^ 0.02249 [230] . 

A critical question is whether the baryon density required by ACDM is consistent with the 
independently measured abundances of the light isotopes. This question is explored in Figure H51 
Historically, no isotope suggested a value Vlbh^ > 0.02 prior to fits to the CMB requiring such a high 
value. This is an important fact to bear in mind, since historically cosmology has a long tradition 
of confirmation biaa^H More recent measurements of deuterium and helium are consistent with 
the high baryon density required by ACDM fits to the CMB. Lithium persistently suggests a lower 
baryon density, consistent with pre-CMB values. If we are convinced of the correctness of ACDM, 
then it is easy to dismiss this as some peculiarity of stars - if exposed to the high temperatures 
in the cores of stars by turbulent mixing, lithium might be depleted from it primordial value. If 
we are skeptical of ACDM, then it is no surprise that measurements of the primordial lithium 
abundance return the same value now as they did before. From the perspective of the no-dark 
matter MOND view, the CMB, lithium, deuterium, and helium all give a consistent baryon density 
given the uncertainties. 

However, the no-CDM ansatz must fail at some point. It could fail outright if the parent MOND 

Perhaps the most famous modern example of confirmation bias is in measurements of the Hubble constant 14671 
where over many years de Vaucouleurs persistently found Hg 100 kms~^ Mpc while Sandage persistently found 
Ho Si 50 kms-l Mpc" . Then, as now, there was a conflation of data with theory: the lower value of Hq was more 
widely accepted because it was required for cosmology to be consistent with the ages of the oldest stars. 
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Figure 46: Estimates of the baryon density flbh^ [where h = Ho/{100 kms~-'^Mpc '^)] over time 
(updated [274J from 1270) ). Big bang nucleosynthesis was already a well established field prior to 
1995; earlier contributions are summarized by compilations (green ovals [4811 1108] ) that gave the 
long-lived standard value ilbh^ = 0.0125 [ 481j . More recent estimates from individual isotopes 
are shown as triangles (^H), squares (^He), diamonds ('^He), and stars (^Li). Estimates of the 
baryon density based on analyses of the cosmic microwave background are shown by circles (dark 
blue for ACDM; light blue for no-CDM). No measurement of any isotope suggested a value greater 
than fifo/i^ = 0.02 prior to observation of the acoustic peaks in the microwave background (dotted 
lines), which might be seen as a possible illustration of confirmation bias. Fitting the acoustic 
peaks in ACDM requires r2f,/i^ > 0.02. More recent measurements of and '*He have migrated 
towards the ACDM CMB value, while ^Li remains persistently problematic |112j . It has been 
suggested that turbulent mixing might result in the depletion of primordial lithium necessary to 
reconcile lithium with the CMB (upward pointing arrow [288] ) while others [406] argue that this 
would merely reconcile some discrepant stars with the bulk of the data defining the Spite plateau, 
which persists in giving a ^Li abundance discrepant from the ACDM CMB value. In contrast, 
the amplitude of the second peak of the microwave background is consistent with no-CDM and 
fife/i^ = 0.014 ± 0.005 [270]. Consequently, from the perspective of MOND, the CMB, lithium, 
deuterium, and helium all give a consistent baryon density given the uncertainties. 
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theory deviates substantially from GR in the early Universe. However, the more obvious |266j 
points of failure are rather due to the anticipated early structure formation in MOND discussed 
above. This should lead, in a true MOND theory, to early re-ionization of the Universe and an 
enhancement of the integrated Sachs- Wolfe effect. Evidence for both these effects are present in 
the WMAP data j270) . Indeed, it turns out to be rather easy, and perhaps too easy, to enhance 
the integrated Sachs-Wolfe (ISW) effect in theories like TeVeS or GEA [IsT I ISTT ] . Nevertheless, 
early re-inioniaztion is an especially natural consequence of MOND structure formation that was 
predicted a priori |266j . In contrast, structure is expected to build up more slowly in ACDM 
such that obtaining the observed early re-ionization implies that the earliest objects to collapse 
were ~ 50 times as efficient at converting mass to ionizing photons as are collapsed objects at the 
present time [436]. 

One prediction of the no-CDM anzatz that should not obviously fail is that the third peak 
should be smaller than the second peak of the acoustic power spectrum of the GMB. In a Universe 
governed by MOND rather than cold dark matter, there is a priori no obvious non-baryonic mass 
that is decoupled from the photon-baryon fluid. It is therefore a strong expectation that we observe 
only baryonic damping in the power spectrum, and each peak should be smaller in amplitude than 
the previous one. Gontrary to this expectation, WMAP observes the third peak to be nearly equal 
in amplitude to the second f 443l 1230] . This approximate equality of the second and third peaks 
falsifies the simple no-CDM anzatz. 
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Figure 47: GMB data as measured by the WMAP satellite year five data release (filled circles) 
and the AGBAR 2008 data release (triangles). Dashed line: AGDM fit. Sohd line: HDM fit with 
a sterile neutrino of mass 11 eV (Figure courtesy of G. Angus) 
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The PLANCK mission should soon report a new and much higher resolution measurement of 
the CMB acoustic power spectrum. It is conceivabl^^ that improved data will reveal a different 
power spectrum. A third peak as low as that expected in the no-CDM anzatz would be one of 
the few observations capable of clearly falsifying the existence of cosmic non-baryonic dark matter. 
A more likely result is basic confirmation of existing observations with only minor tweaks to the 
exact power spectrum. Such a result would have little impact on the discussion here as it would 
simply confirm the need for some degrees of freedom in relativistic MOND theories that can play 
a role analogous to CDM. However, the uncertainties on the best fit cosmological parameters may 
become negligibly small. Precise as current data are, cosmology (with the exception of BBN) is 
still far from being over-constrained. Hopefully PLANCK data will be sufficiently accurate that 
they either agree or clearly do not agreq£2l with a host of other observations. 

Presuming nothing substantial changes in the CMB data, we must understand the net forcing 
term in the acoustic oscillations leading to a high third peak. This might be taken in one of three 
ways: 

(i) Practical falsification of MOND, 

(ii) Proof of the existence of some form of non-baryonic matter particles, 

(iii) An indication of some necessary additional freedom in relativistic parent theories of MOND, 
playing the role of the non-baryonic mass in the CMeF^. 

Tempting as the first case (i) is [433], we cannot know whether the CMB falsifies MOND until we 
have exhaustively explored the predictions of relativistic parent theories (Sect. 7). The possibility 
of true non-baryonic mass (ii) a priori seems unelegant, although a modification of gravity and 
the existence of non-baryonic dark matter are not mutually exclusive concepts. What is more, 
there is one obviously existing form of non-baryonic mass that may be relevant on cosmic scales: 
neutrinos. If rrii, ~ V Am^ [435], then the neutrino mass is too small to be of interest in this 
context. However, as discussed above, a modest neutrino mass may help to prevent MOND from 
over-predicting the growth of structure. Independently, a mass « 1 eV to 2 eV for the three 
neutrino species provides a good match to the width of the acoustic peaks of the CMB \210\ , which 
are otherwise too wide in a purely baryonic Universe. Note that it provides as well a match to 
the missing mass in galaxy clusters of T > 4 keV (see Sect. 6.6.4). However, this neutrino mass 
is inadequate to explain the relatively high third peak in the no-DM ansatz. Obtaining a match 
to that rather requires a neutrino mass (for only one species) of ~ 10 eV [S]- Such a large mass 
violates experimental constraints on the ordinary neutrino mass [235J . but it may be possible to 
have a sterile neutrino with a mass in that ballpark [27] . As strange as this sounds, it provides a 
good fit to the CMB (Figure S?]) , and it may provide the unseen mass in all clusters and groups 
(see Sect. 6.6.4 [13] [H]). Experiments that can address the existence of such a particle would 

^^At I 800, the third peaks is only marginally resolved by WMAP. This scale is comparable to a single 
(frequency dependent) beam size, and as such is extraordinarily sensitive to corrections for the instrumental point 
spread function I405| . 

^^Determining agreement between independent observations requires that we believe not just the result (e.g., the 
value of H(j from direct distance measurements) but also its uncertainty. The latter has always been challenging in 
astronomy, and the history of cosmology is replete with examples of results that were simply wrong. While we may 
have entered the era of precision cosmology, we have yet to reach an era when data are so accurate that we can 
hope to challenge cosmology with falsification if, for example, PLANCK data require Hq < 60kms~^ Mpc~^ while 
galaxy distances require Hq > 70kms~^ Mpc~^. 

The third possibility actually means either non-local effects in non-local theories (Sect. 7.10), or the effect of 
additional fields in local modified gravity theories. The important difference with CDM is that these fields are not 
simply representative of coUisionless massive particles, that their behaviour is determined by the baryons in static 
configurations, and that they can be subdominant to the baryonic density. In theories where their energy density 
dominates that of baryons, these new fields then really act as dark matter in the early Universe, which is also a 
possibility (see Sect. 7.6 and 7.9) 
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thus be very interesting [289 , although in between it is perhaps best to view it merely as the 
encapsulation of our ignorance about cosmology in modified gravity theories, much as dark energy 
currently plays the same role in conventional cosmology. The fit of Figure 1471 ^ is at least a proof 
of concept that cold DM is definitely not required by the CMB alone. 

Perhaps the most intriguing possibility is (iii) , that the height of the third peak is providing a 
glimpse of some new aspect of modified gravity theories. As we have seen, generalizations of GR 
seeking to incorporate MONDian phenomenology must, per force, introduce either non-locality 
(Sect. 7.10), or new degrees of freedom in local theories. It is at least conceivable that these 
new degrees of freedom result in the net driving of the acoustic oscillations that is implied by 
the departure from pure baryonic damping. For instance, Dodelson & Liguori jl29) have shown 
that in TeVeS (Sects. 7.3 and 7.4) or GEA (Sect. 7.7) theories, based on unit-norm vector fields, 
the growth of the spatial part of the vector perturbation in the course of cosmological evolution is 
acting as an additional seed akin to non-baryonic dark mattej^ (but unlike dark matter, its energy 
density is subdominant to the baryonic mass). Actually, it has been shown that, with the help of 
this effect prior to baryon-photon decoupling, it is actually possible^ to produce as high a third 
peak as the second one in TeVeS and GEA theories without non-baryonic dark matter, but at the 
cost of leading to unacceptably high temperature anisotropics in the CMB on large angular scales, 
due to an over-enhanced ISW effect [4311 1517] . Indeed, when making the effect of the growth of 
the perturbed vector modes large, one also generates |1521 14101 1499] a large gravitational s lip (see 
Sect. 8.4) in the perturbed FLRW metric fEg. 1116] ). which in turn leads to enhanced ISW^U For 
this reason, acceptable fits to the CMB in TeVeS or GEA still need to appeal to non-baryonic 
mass j431j . In this case, ordinary neutrinos within their model-independent mass-limit [235j are 
sufficient, thouglS. The gravitational slip could however be able to soon exclude at least some of 
these models from combined information on the matter overdensity and weak lensing |363[ 1499) . 
However, an important caveat is that all of the above arguments are based on adiabatic initial 
conditionj^. While initial isocurvature perturbations are basically ruled out in the GR context, 
this is not necessarily true for modified gravity theories, so that correlated mixtures of adiabatic 
and isocurvature modes could perhaps lower the ISW effect and/or raise the third peak [430] . 

Of course, when the additional "dark fields" of relativistic MOND theories are truly massive (as 
is the case in some theories), they can be thought of as true "dark matter", whose energy density 
outweighs the baryonic one in the early Universe: this is the case for the second scalar field of 
BSTV (Sect. 7.5), the scalar field of Sect. 7. 6, and of course the dipolar dark matter of Sect. 7.9. In 
all these cases, reproducing the acoustic peaks of the CMB is, by construction, not a problem at 
all (nor erasing the baryon acoustic oscillations in the matter power spectrum contrary to ^128j ). 
while the MOND phenomenology is still nicely recovered in galaxies. In the case of BIMOND 
(Sect. 7.8), the possible appeal to twin matter could also have important consequences on the 
growth of structure [317] and of course on the CMB acoustic peaks too, although the latter analysis 
is still lacking. In an initially matter-twin matter symmetric Universe, if the initial quantum 
fluctuations are not identical in the two sectors, matter and twin matter would still segregate 
efficiently, since density differences grow much faster that the sum [317] . The inhomogeneities of 
the two matter types would then develop, eventually, into mutually avoiding cosmic webs, and the 
tensors coming from the variation of the interaction term between the two metrics with respect 
to the matter metric can then act precisely as the energy-momentum tensor of cosmological dark 

^®In TeVeS, the perturbations of the scalar field also play an important role in generating enhanced growth |147l . 
P. Ferreira's talk, Alternative Gravities and Dark Matter Workshop, Edinburgh, April 2006. 
The ISW effect can be casted as the integral of — (# + + 2<1?', thus involving both a gravitational slip part 
and a growth rate part. 

Note that the presence of non-baryonic matter in the form of massive neutrinos also helps damping the baryonic 
acoustic oscillations |128| at 2: = 0, as can be seen on Figure 4 of |431| . 

^^This means that, on surfaces of constant temperature, the densities of the various components (e.g. baryons, 
neutrinos, additional dark fields) are uniform, and that these components share a common velocity field. 
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matter |317J . besides its contribution to the cosmological constant (see Sect. 9.1). Finally, the most 
thought-provoking and interesting possibility would perhaps be to explain all these cosmological 
observations through non-local effects (Sect. 7.10). In any case, it is likely that MOND will not be 
making truly clear predictions regarding cosmology until a more profound theory, based on first 
principles and underlying the MOND paradigm, will be found. 
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10 Summary and Discussion 



In this review, after briefly presenting tfie currently favored ACDM model of cosmology (which 
clearly works overwhelmingly well on large scales despite its slightly unelegant mixture of currently 
unknown elements, Sects. 2 and 3), we reviewed the few most outstanding challenges that this model 
is still facing (Sect. 4), which will have to be addressed one way or the other in the coming years. 
These include coincidences at z = between the scale of the energy density in dark energy, dark 
matter, and baryonic matter, as well as a common natural scale for the behavior of the dark matter 
and dark energy sectors. What is more, as far as galaxy formation is concerned, many predictions 
made by the model (keeping in mind that baryon physics could modify these predictions) were 
ruled out by observations: these include many observations indicating that structure formation 
should take place earlier than predicted, the low number of observed satellites around the Milky 
Way (especially the missing satellites at the low and high mass ends of the mass function), the 
phase-space correlation of satellite galaxies of the Milky Way as opposed to their predicted isotropic 
distribution, the apparent presence of constant DM density cores in the central parts of galaxies 
instead of the predicted cuspy dark halos, the over-abundance of large bulgeless thin disk galaxies 
that are extremely difficult to produce in simulations, or the presence of spiral arms in disks that 
should be immune to such instabilities. But even more challenging is the appearance (Figure [48]) 
of an acceleration constant qq ~ 10"^" ms~^ (i.e., the common scale of the dark matter and dark 
energy sectors as oq ^ A^^^ in natural units) in many a priori unrelated scaling relations for DM 
and baryons in galaxies. These scaling relations involve a possibly devastating amount of fine- 
tuning for all coUisionless dark matter models (Sect. 4.3), and can all be summarized by Milgrom's 
empirical formula (Sect. 5), meaning that the observed gravitational field in galaxies is mimicking 
a universal force law generated by the baryons alone. 
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Figure 48: The acceleration parameter ~ Vf / {GMb) of extragalactic systems, spanning ten decades 
in baryonic mass Aff,. X-ray emitting galaxy groups and clusters are visibly offset from smaller 
systems, but by a remarkably modest amount over such a long baseline. The characteristic ac- 
celeration scale ao ^ a/A is in the data, irrespective of the interpretation. And it actually plays 
various other independent roles in observed galaxy phenomenology. This is natural in MOND (see 
Sect. 5.2), but not in ACDM (see Sect. 4.3). 

With inert, coUisionless and dissipationless DM, making Milgrom's law emerge requires a huge, 
and perhaps even unreasonable, amount of fine-tuning in the expected feedback from the baryons. 
Indeed, the relation between the distribution of baryons and DM should a priori depend on 
the various different histories of formation, intrinsic evolution, and interaction with the envi- 
ronment of the various different galaxies, whereas Milgrom's law provides a sucessful unique and 
history-independent relation. Given this puzzle, the central idea of Modified Newtonian Dynam- 
ics (MOND) is to rather explore the possibility that the force law is indeed effectively modified 
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(Sect. 6). The main motivation for studying MOND is thus a fully empiricist one, as it is driven by 
the observed phenomenology on galaxy scales, and not by an aesthetic wish of getting rid of DM. 
The corollary is that it is a priori not a problem for a theory designed to reproduce the uncanny 
successes of the MOND phenomenology to replace CDM by "dark fields" (see Sect. 7) or more ex- 
otic forms of DM, different from simple coUisionless DM particles, contrary to the common belief 
that this would be against the spirit of the MOND paradigm (although it is true that it would 
be more elegant to avoid too many additional degrees of freedom). It is perhaps more important 
that, if MOND is correct in the sense of the acceleration oq being a truly fundamental quantity, 
the strong equivalence principle cannot hold anymore, and local Lorentz invariance could perhaps 
be spontaneously violated too. 

At this juncture, it is worthwhile to summarize the general predictions of MOND, as a paradigm, 
and their observational tests (Table [2]). As a mathematical description of the effective force law, 
MOND works remarkably well in individual galaxies. As a modified gravity theory (at the classical 
level) , it makes some predictions that are both unique and challenging to reproduce in the context 
of the ACDM paradigm. However, MOND faces sharp challenges, particularly with cosmology and 
in rich clusters of galaxies, which will not be conclusively addressed without a viable parent theory 
(Sect. 7), based on first principles and underlying the MOND paradigm (if such a theory exists at 
all). In any case, in his series of papers introducing the idea in 1983, Milgrom [295] made a few 
very explicit predictions, which we quote hereafter, and compare with modern observational data 
(see also the Kepler-like laws of galactic dynamics in Sect. 5.2): 

• "Velocity curves calculated with the modified dynamics on the basis of the observed mass in 
galaxies should agree with the observed curves. " 

It is now well established that MOND provides good fits to the rotation curves of galaxies 
(Figure [23] [3991 1167] ). including bumps and wiggles associated to a baryonic counterpart 
(Figure [m Kepler-like law n°10 in Sect. 5.2). These fits are obtained with a single free 
parameter per galaxy, the mass-to-light ratio of the stars. What makes them most impressive 
is that the best-fit mass-to-light ratios, obtained on purely dynamical grounds assuming 
MOND, vary with galaxy color exactly as one would expect from stellar population synthesis 
models [43], that are based on astronomers' detailed understanding of stars. Note that the 
rotation curves of galaxies are predicted to be asymptotically flat, even though this flatness 
is not always attained at the last observed point (see Kepler-like law n°l in Sect. 5.2, and 
last explicit prediction hereafter). 

• "The relation between the asymptotic velocity and the mass of the galaxy is an absolute one. " 

This is the Baryonic TuUy-Fisher relation with Mf, = uqGV^ (see Kepler-like laws n°2 
in Sect. 5.2). It appears to hold quite generally [273] . even for galaxies that we would 
conventionally expect to deviate from it |166l 12791 1278] . 

• "Analysis of the z-dynamics in disk galaxies using the modified dynamics should yield surface 
densities which agree with the observed ones. " 

This states that in addition to the radial force giving the rotation curve, the motions of 
stars perpendicular to the disk must also follow from the source baryons (see Sect. 6.5.3). 
This proves to be a remarkably challenging observation, and such data for external galaxies 
are dear to obtain [45]. To make matters still more difficult, the radial acceleration usually 
dominates the vertical {V'^/r ^ <^1/ z). This has the consequence that the distinction between 
MOND and conventional dynamics is not pronounced in regions that are well observed, 
becoming pronounced only at rather low baryonic surface densities [279] . The vertical velocity 
dispersions in low surface density regions (see Sect. 6.5.3) is typically ~ 8 km/s [26 l 1242] . 
This exceeds the nominal Newtonian expectation (typically ~ 2 km/s for S = IMqpc"^, 
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depending on the thickness of the disk), and is more in accordance with MOND. However, it 
would require a considerably more detailed analysis to consider this a test, let alone a success, 
of MOND. The Milky Way (Sect. 6.5.2) may provide an excellent test for this prediction [5T1 
1379) as more precision data become available. 

"Effects of the modification are predicted to be particularly strong in (low surface brightness) 
dwarf galaxies. " 

The dwarf spheroidal satellite galaxies of the Milky Way have very low surface densities of 
stars, so (see Kepler-like law n°8 in Sect. 5.2) are far into the MOND regime. As expected, 
these systems exhibit large mass discrepancies |478| 1428] . Detailed fits to the better observed 
"classical" dwarfs [8] are satisfactory in most cases (see Sect. 6.6.2). The so-called "ultrafaint" 
dwarfs appear more problematic [285 !. in the sense that their velocity dispersions are higher 
than expected. This might be an indication of the MOND-specific external field effect (see 
Sect. 6.3 and [79]), as the field of the Milky Way dominates the internal fields of the ultrafaint 
dwarfs. If so, these objects are not in dynamical equilibrium, which considerably complicates 
their analysis. 

Locally measured mass-to-light ratios should show no indication of hidden mass when j R 3> 
ao, but rise beyond the radius where /R « ao. 

We have paraphrased this prediction for brevity (see also Kepler-like law n°7 in Sect. 5.2). 
The test of this prediction is shown in Figures [101 HU andlMl The predicted effect is obvious in 
the data with populations synthesis mass-to-light ratios for the stars |43j , or with dynamical 
mass-to-light ratios [279 that make no assumption about stellar mass. In HSB spirals, there 
is no obvious need for dark matter in the inner regions, with the mass discrepancy only 
appearing at large radii as the acceleration drops below oq (Figure 1101) . 

"Disk galaxies with low surface brightness provide particularly strong tests. " 

Low surface brightness means low stellar surface density, which in turns means low accelera- 
tion. LSB galaxies are thus predicted to be well into the modified regime (see also Kepler-like 
law n°8 in Sect. 5.2). This was a strong a priori prediction, because few bona-fide exam- 
ples of such objects were known at the time. Indeed, in 1983, when these predictions were 
published, it was widely thought that nearly all disk galaxies shared a common high surface 
brightness. One specific consequence of MOND for LSB galaxies is that they should lie on 
the same BTFR, with the same normalization, as high surface brightness spirals. This was 
subsequently observed to be the case [5181 1444] . There is no systematic deviation from the 
BTFR with surface brightness (Figure [5]), thus contrary to what is naturally expected in 
conventional dynamics [279, TlOj . Another consequence of low surface density is that the 
acceleration is low (< ao) everywhere. As a result, the mass discrepancy appears at a smaller 
radius in low surface brightness galaxies, and is larger in amplitude than in high surface 
brightness galaxies. This effect was subsequently observed (Figure [TJ] [279 ). 

"We predict a correlation between the value of the average surface density of a galaxy and 
the steepness with which the rotational velocity rises to its asymptotic value. " 

MOND does not simply make rotation curves flat. It predicts that high surface brightness 
galaxies have rotation curves that rise rapidly before becoming flat, and may even fall towards 
asymptotic flatness. In contrast, low surface brightness galaxies should have slowly rising 
rotation curves that only gradually approach asymptotic flatness (see also Kepler-like law 
n°8 in Sect. 5.2). Both morphologies are observed (Figure ITS]) . The expected connection 
between dynamical acceleration and the surface density of the source baryons is illustrated 
in Figures [HI and [TCI 
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The original predictions listed above cover many situations, but not all. Indeed, once one 
writes a specific force law, its application must be completely general. Such a hypothesis is readily 
subject to falsification, provided sufficiently accurate data to test it - a perpetual challenge for 
astronomy. Table [2] summarizes the tests discussed here. By and large, tests of MOND involving 
rotationally supported disk galaxies are quite positive, as largely detailed above (see Sect. 6.5). By 
construction, there is no cusp problem (solution to the challenge n°6 of Sect. 4.2), and no missing 
baryons problem (solution to the challenge n°10 of Sect. 4.2), as the way the dynamical mass-to- 
light ratio systematically varies with the circular velocity is a direct consequence of Milgrom's law 
(Kepler-like law n°4 of Sect. 5.2). There does appear to be a relation between the quality of the 
data and the ease with which a MOND fit to the rotation curve is obtained, in the sense that fits are 
most readily obtained with the best data [29] . As the quality of the data decline |384j , one begins to 
notice small disparities. These are sometimes attributable to external disturbances that invalidate 
the assumption of equilibrium [40 1) . For targets that are intrinsically difficult to observe, minor 
problems become more common |1211I449] . These typically have to do with the challenges inherent 
in combining disparate astronomical data sets (e.g., rotation curves measured independently at 
optical and radio wavelengths) and constraining the inclinations of low surface brightness galaxies 
(bear in mind that all velocities require a sin(i) correction to project the observed velocity into the 
plane of the disk, and mass in MOND scales as the fourth power of velocity). Given the intrinsic 
difficulties of astronomical observations, it is remarkable that the success rate of MOND fits is 
as high as it is: of the 78 galaxies that have been studied in detail (see Sect. 6.5.1), only a few 
cases (most notably NGC 3198 [69l I167j ) appear to pose challenges. Given the predictive and 
quantitative success of the majority of the fits, it would seem unwise to ignore the forest and focus 
only on the outlying trees. 

One rotationally supported system that is very familiar to us is the Solar system (see Sect. 6.4). 
The Solar system is many orders of magnitude removed from the MOND regime (Figure fTlT) . so no 
strong effects are predicted. However, it is of course possible to obtain exquisitely precise data in the 
Solar system, so it is conceivable that some subtle effect may be observable |391j . Indeed, the lack 
of such effects on the inner planets already appears to exclude some slowly varying interpolation 
functions [53]. Other tests may yet prove possible [381 1315] . but, as they are strong- field gravity 
tests by nature, they all depend strongly on the parent relativistic theory (Sect. 7) and how it 
converges towards GR [22]. So in Table [S] we list the status of Solar system tests as unclear, 
depending on the parent relativistic theory. 

An important aspect of galactic disks is their stability (see Sect. 6.5.3). Indeed, the need to 
stabilize disks was one of the early motivations for invoking dark matter |344| . MOND appears 
able to provide the requisite stability [78 • Indeed, it gives good reason [300] for the observed 
maximum in the distribution of disk galaxy surface densities at ~ I]| = ag/G (Freeman's limit: 
Figure [8] and Kepler-like law n°6 in Sect. 5.2). Disks with surface densities below this threshold 
are in the low acceleration limit and can be stabilized by MOND. Higher surface density disks 
would be purely in the Newtonian regime and subject to the usual instabilities. Going beyond 
the amount of stability required for existence, another positive aspect of MOND is that it does 
not over-stabilize disks. Features like bars and spiral arms are a natural result of disk self-gravity. 
Conventionally, large halo-to-disk mass ratios suppress the growth of such features, especially in 
low surface brightness galaxies [292]. Yet such features are present^"!. The suppression is not as 
great in MOND [78] , and numerical simulations appear to do a good job of reproducing the range of 
observed morphologies of spiral galaxies (solution to the challenge n°9 of Sect. 4.2, see [459'). Bars 
tend to appear more quickly and are fast, while warps can also be naturally produced (Sect. 6.5.3). 
There appears to be no reason why this should not extend to thin and bulgeless disks, whose 
ubiquity poses a challenge to galaxy formation models in ACDM. This particular point of creating 

|279l utilized this fact to predict that conventional analyses of low surface brightness disks would infer abnor- 
mally high mass-to-light ratios for their stellar populations - a prediction that was subsequently confirmed ^160. .372] . 
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large bulgeless disks (challenge n°8 of Sect. 4.2) can actually be solved thanks to early structure 
formation followed by a low galaxy-interaction rate in MONDian cosmology (see Sect. 9.2), but 
this definitely warrants further investigation, so we mark this case as merely promising in Table [51 

Interacting galaxies are by definition non-stationary systems in which the customary assumption 
of equilibrium does not generally hold. This renders direct tests of MOND difficult. However, it is 
worth investigating whether commonly observed morphologies (e.g., tidal tails) are even possible 
in MOND. Initially, this seemed to pose a fundamental difficulty [279], as dark matter halos play 
a critical role in absorbing the orbital energy and angular momentum that it is necessary to shed 
if passing galaxies are to not only collide, but stick and merge. Nevertheless, recent numerical 
simulations appear to do a nice job of reproducing observed morphologies |460) . This is no trivial 
feat. While it is well established that dark matter models can result in nice tidal tails, it turns 
out to be difficult to simultaneously match the narrow morphology of many observed tidal tails 
with rotation curves of the systems from which they come [131] . Narrow tidal tails appear to be 
natural in MOND, as well as more extended resulting galaxies, thanks to the absence of angular 
momentum transfer to the dark halo (solution to the challenge n°7 of Sect. 4.2). Additionally, tidal 
dwarfs that form in these tails clearly have characteristics closer to those observed (see Sect. 6.5.4) 
than those from dark matter simulations |166| I310j . 

Spheroidal systems also provide tests of MOND (Sect. 6.6). Unlike the case of disk galaxies, 
where orbits are coplanar and nearly circular so that the centripetal acceleration can be equated 
with the gravitational force, the orbits in spheroidal systems are generally eccentric and randomly 
oriented. This introduces an unknown geometrical factor usually subsumed into a parameter that 
characterizes the anisotropy of the orbits. Accepting this, MOND appears to perform well in the 
classical dwarf spheroidal galaxies, but implies that the ultrafaint dwarfs are out of equilibrium 
(see Sect. 6.6.2). For small systems like the ultrafaint dwarfs and star clusters (Sect. 6.6.3) within 
the Milky Way, the external field effect (Sect. 6.3) can be quite important. This means that 
star clusters generally exhibit Newtonian behavior by virtue of being embedded in the larger 
Galaxy. Deviations from purely Newtonian behavior are predicted to be subtle and are fodder for 
considerable debate }200| 1397] , rendering the present status unclear (Table [2]) . At the opposite 
extreme of giant elliptical galaxies (Sect. 6.6.1), the data accord well with MOND [324] . Indeed, 
bright elliptical galaxies are sufficiently dense that their inner regions are well into the Newtonian 
regime. In the MONDian context, this is the reason that it has historically been difficult to find 
clear evidence for mass discrepancies in these systems. The apparent need for dark matter does 
not occur until radii where the accelerations become low. That only spheroidal stellar systems 
appear to exist at surface densities in excess of E-f is the corollary of Freeman's limit: such dense 
systems could not exist as stable disks, so must per force become elliptical galaxies, regardless of 
the formation mechanism that made them so dense. That populations of elliptical galaxies should 
obey the Faber- Jackson relation (Kepler-like law n°3 in Sect. 5.2, Figure [7]) is also very natural to 
MOND [38311395] . 

The largest gravitationally bound systems are also spheroidal systems: rich clusters of galaxies. 
The situation here is quite problematic for MOND (Sect. 6.6.4). Applying MOND to ascertain 
the dynamical mass routinely exceeds the observed baryonic mass by a factor of 2 to 3. In effect, 
MOND requires additional dark matter in galaxy clusters. The need to invoke unseen mass is 
most unpleasant for a theory that otherwise appears to be a viable alternative to the existence of 
unseen mass. However, one should remember that the present-day motivation for studying MOND 
is driven by the observed phenomenology on galaxy scales, summarized above, and not by an 
aesthetic wish of getting rid of DM. What is more, parent relativistic theories of MOND might 
well involve additional degrees of freedom in the form of "dark fields" . But in any case, one must 
be careful not to conflate the rather limited missing mass problem that MOND suffers in clusters 
with the non-baryonic collisionless cold dark matter required by cosmology. There is really nothing 
about the cluster data that requires the excess mass to be non-baryonic, as long as it behaves in a 
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collisionless way. There could for instance be baryonic mass in some compact non-luminous form 
(see Sect. 6.6.4 for an extensive discussion). This might seem to us unlikely, but it does have 
historical precedent. When Zwicky |519] first identified the dark matter problem in clusters, the 
mass discrepancy was of order ~ 100. That is, unseen mass outweighed the visible stars by two 
orders of magnitude. It was only decades later that it was recognized that baryons residing in a hot 
intracluster gas greatly outweighed those in stars. In effect, there were [at least] two missing mass 
problems in clusters. One was the hot gas, which reduces the conventional discrepancy from a factor 
of ^ 100 to a factor of ^ 8 [176] in Newtonian gravity. From this perspective, the remaining factor 
of two in MOND seems modest. Rich clusters of galaxies are rare objects, so the total required 
mass density can readily be accommodated within the baryon budget of big bang nucleosynthesis. 
Indeed, according to BBN, there must still be a lot of unidentified baryons lurking somewhere in 
the Universe. But the excess dark mass in clusters need not be baryonic, even in MOND. Massive 
ordinary neutrinos [3891 1392] and light sterile neutrinos [S] [13] have been suggested as possible 
forms of dark matter that might provide an explanation for the missing mass in clusters. Both are 
non-baryonic, but as they are hot DM particle candidates, neither can constitute the cosmological 
non-baryonic cold dark matter. At this juncture, all we can say for certain is that we do not know 
what the composition of the unseen mass is. It could even just be an evidence for the effect of 
additional "dark fields" in the parent relativistic formulation of MOND, such as massive scalar 
fields, vector fields, dipolar dark matter, or even subtle non-local effects (see Sect. 7). 

There are other aspects of cluster observations that are more in line with MOND's predictions. 
Clusters obey a mass-temperature relation that parallels the M (x (x prediction of MOND 
(Figures [39] and |48]) more closely than the conventional prediction of M c>c T'^/^ expectation in 
ACDM, without the need to invoke preheating (a need that may arise as an artifact of the mismatch 
in slopes). Indeed, Figure [35] shows clearly both the failing of MOND in the offset in characteristic 
acceleration between clusters and lower mass systems, and its successful prediction of the slope 
(a horizontal line in this figure). A further test which may be important is the peculiar and bulk 
velocity of clusters. For example, the collision velocity of the bullet cluster is so larg as to be 
highly improbable in ACDM (occurring with a probability of ~ 10~^° |250j ). In contrast, large 
collision velocities are natural to MOND [IB] . Similarly, the large scale peculiar velocity of clusters 
is observed to be ^ 1000 km/s '222J, well in excess of the expected ~ 200 kms~^. Ongoing 
simulations with MOND 11 show some promise to produce large peculiar velocities for clusters. 
In general, one would expect high speed collisions to be more ubiquitous in MOND than ACDM. 

An important line of evidence for mass discrepancies in the Universe is gravitational lensing 
in excess of that expected from the observed mass of lens systems. Lensing is an intrinsically 
relativistic effect that requires a generally covariant theory to properly address. This necessarily 
goes beyond MOND itself into specific hypotheses for its parent theory (Sect. 7), so is somewhat 
different than the tests discussed above. Broadly speaking, tests involving strong gravitational 
lensing fare tolerable well (Sect. 8.1), whereas weak lensing tests, that are sensitive to larger scale 
mass distributions, are more problematic (Sects. 8.2, 8.3, and 8.4) or simply crash into the usual 
missing mass problem of MOND in clusters. Note that weak lensing in relativistic MOND theories 
produces the same amount of lensing as required from dynamics, so this is not the problem. The 
problematic fact is just that some tests seem to require more dark matter than the effect of MOND 
provides. 

On larger (cosmological) scales, MOND, as a modification of classical (non-covariant) dynamics, 
is simply unsatisfactory or mute. MOND itself has no cosmology, providing analogs for neither 
the Friedmann equation for the dynamics of the Universe, nor the Robertson- Walker metric for 
its geometry. For these, one must appeal to specific hypotheses for the relativistic parent theory 
of MOND (Sect. 7), which is far from unique, and theoretically not really satisfactory, as none of 

The observed shock velocity of ^ 4700 km/s is thought to be enhanced by hydrodynamical effects. The coUision 
velocity is improbable after a substantial (~ 1700 km/s) correction for this |259| . 
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the present candidates emerges from first principles. At this juncture, it is not clear whether a 
compelling candidate cosmology will ever emerge. But on the other hand, there is nothing about 
MOND as a paradigm that contradicts per se the empirical pillars of the hot big bang: Hubble 
expansion, big bang nucleosynthesis, and the relic radiation field (Sect. 9). The formation of large 
scale structure is one of the strengths of conventional theory, which can be approached with linear 
perturbation theory. This leads to good fits of the power spectrum both at early times (z « 1000 
in the cosmic microwave background) and at late times (the z = galaxy power spectrum [453] ). 
In contrast, the formation of structure in MOND is intrinsically non-linear. It is therefore unclear 
whether MOND-motivated relativistic theories will inevitably match the observed galaxy power 
spectrum, a possible problem being how to damp the baryon acoustic oscillations |128l 1431] . At 
this stage, a unique prediction does not exist. Nevertheless, there are two aspects of structure 
formation in MOND that appear to be fairly generic and distinct from ACDM. The stronger 
effective long range force in MOND speeds the growth rate, but has less mass to operate with as 
a source. Consequently, radiation domination persists longer and structure formation is initially 
inhibited (at redshifts of hundreds). Once structure begins to form, the non-linearity of MOND 
causes it to proceed more rapidly than in GR with CDM. Three observable consequences would be 
(i) the earlier emergence of large objects like galaxies and clusters in the cosmic web (as well as the 
associated low interaction rate at smaller redshifts) providing a possible solution to challenge n°2 
of Sect. 4.2 jlTj, (ii) the more efficient evacuation of large voids (possible solution to challenge n°3 
of Sect. 4.2), and (iii) larger pecuhar (and colhsional [16]) velocities of galaxy clusters (solution 
to challenge n°l of Sect. 4.2). The potential downside to rapid structure formation in MOND, 
however, is that it may overproduce structure by redshift zero |342| 1251] . 

The final entries in Table [2] regard the cosmic microwave background, discussed in more detail in 
Sect. 19.21 The third peak of the acoustic power spectrum of the CMB poses perhaps the most severe 
challenge to a MONDian interpretation of cosmology. The amplitude of the third peak measured 
by WMAP is larger than expected in a Universe composed solely of baryons |443j . This implies 
some substance that does not oscillate with the baryons. Cold dark matter fits this bill nicely. 
In the context of MOND, we must invoke some other massive substance (i.e., non-baryonic dark 
matter such as, e.g., light sterile neutrinos [2) that plays the role of CDM, or rely on additional 
degrees of freedom in the relativistic parent theory of MOND (see Sect. 7) that would have the 
same net result (see the extensive discussion in Sect. 9.2), or even combine non-baryonic dark 
matter with these additional degrees of freedom |431] . While these are real possibilities, neither 
are a priori particularly appealing, any more than it is to invoke CDM with complex fine-tuned 
feedback to explain rotation curves that apparently require only baryons as a source. 

The missing baryon problem that MOND suffers in rich clusters of galaxies and the third peak 
of the acoustic power spectrum of the CMB are thus the most serious challenges presently facing 
MOND. But even so, the interpretation of the acoustic power spectrum is not entirely clear cut. 
Though there is no detailed fit to the power spectrum in MOND (unless we invoke lOeV-scale 
sterile neutrinos [91), MOND did motivate the a priori prediction 1266' of two aspects of the CMB 
that were surprising in ACDM (see Sect. 9.2). The amplitude ratio of the first-to-second peak in 
the acoustic power spectrum was outside the bounds expected ahead of time by ACDM for fti, 
from big bang nucleosynthesis as it was then known (see Sect. 19. 2p . In contrast, the first:second 
acoustic peak ratio that is now well measured agrees well with the quantitative value predicted in 
advance for the case of the absence of cold dark matter |2691 1270] . Similarly, the rapid formation 
of structure expected in MOND leads naturally to an earlier epoch of re-ionization than had been 
anticipated in ACDM [26611270] . Thus, while the amplitude of the third peak is clearly problematic 
and poses a severe challenge for any MOND-inspired theories, the overall interpretation of the CMB 
is debatable. While the existence of non-baryonic cold dark matter is a priori the most obvious 
explanation of the third peak indeed, it is not at all obvious that straightforward CDM - in the 
form of rather simple massive inert collisionless particles - is uniquely required. 
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Science is in principle about theories or models that are falsifiable, and thus that are presently 
either falsified or not. But in practice it does not (and cannot) really work that way: if a model that 
was making good predictions up to a certain point suddenly does not work anymore (i.e., does not 
fit some new data), one obviously first tries to adjust it to make it fit the observations rather than 
throwing it away immediately. This is what one calls the requisite "compensatory adjustments" 
of the theory (or of the model) : Popper himself drew attention to these limitations of falsification 
in The Logic of Scientific Discovery |356) . In the case of the ACDM model of cosmology, which 
is mostly valid on large scales, the current main trend is to find the "compensatory adjustments" 
to the model to make it fit in galaxies, mainly by changing (or mixing) the mass(es) of the dark 
matter particles, and/or through artificially fine-tuned baryonic feedback in order to reproduce the 
success of MOND. Incidentally, exactly the same is true for MOND, but for the opposite scales: 
MOND works remarkably well in galaxies but apparently needs compensatory adjustments on 
larger scales to effectively replace CDM. Now does that mean that falsification is impossible? That 
all models are equal? Surely not. In the end, a theory or a model is really falsified once there are 
too many compensatory adjustments (needed in order to fit too many discrepant data), or once 
these become too twisted (like Tycho Brahe's geocentric model for the Solar system). But there is 
obviously no truly quantitative way of ascertaining such global falsification. How one chooses to 
weigh the evidence presented in this review necessarily informs one's opinion of the relative merits 
of ACDM and MOND. If one is most familiar with cosmology and large scale structure, ACDM 
is the obvious choice, and it must seem rather odd that anyone would consider an alternative 
as peculiar as MOND, needing rather bizarre adjustments to match observations on large scales. 
But if one is more concerned with precision dynamics and the observed phenomenology in a wide 
swath of galaxy data, it seems just as strange to invoke non-baryonic cold dark matter together 
with fine-tuned feedback to explain the appearance of a single effective force law that appears to act 
with only the observed baryons as a source. Perhaps the most important aspect before one throws 
away any model is to have a "simpler" model at hand, that still reproduces the successes of the 
earlier favored model but also naturally explains the discrepant data. In that sense, right now, it is 
absolutely fair to say that there is no alternative which really does better overall than ACDM, and 
in favor of which Ockham's razor would be. It would however probably be a mistake to persistently 
ignore the fine-tuning problems for dark matter and the related uncanny successes of the MOND 
paradigm on galaxy scales, as they could very plausibly point at a hypothetical better new theory. 
It is also important to bear in mind that MOND, as a paradigm or as a modification of Newtonian 
dynamics, is not itself generally covariant. Attempts to construct relativistic theories that contain 
MOND in the appropriate limit (Sect. 7) are correlated but distinct efforts, and one must be careful 
not to conflate the two. For example, some theories, like TeVeS (Sect. 7.4), might make predictions 
that are distinct from GR in the strong-fleld regime. Should future tests falsify these distinctive 
predictions of TeVeS while confirming those of GR, this would perhaps falsify TeVeS as a viable 
parent theory for MOND, but would have no bearing on the MONDian phenomenology observed 
in the weak-field regime, nor indeed on the viability of MOND itself. It would perhaps simply 
indicate the need to continue to search for a deeper theory. It would for instance be extremely 
alluring if one would manage to find a physical connection between the dark energy sector and 
the possible breakdown of standard dynamics in the weak- field limit, since both phenomena would 
then simply refiect discrepancies with the predictions of GR when A ~ Oq is set to zero (see, 
e.g.. Sect. 7.10). It is of course perfectly conceivable that such a deep theory does not exist, and 
that the apparent MONDian behavior of galaxies will be explained through small compensatory 
adjustments of the current ACDM paradigm, but one has yet to demonstrate how this will occur, 
and it will inevitably involve a substantial amount of fine-tuning that will have to be explained 
naturally. In any case, the existence of a characteristic acceleration oq (Figure |48]) playing various 
different roles in many seemingly independent galactic scaling relations (see Sects. 4.3 and 5.2) is 
by now an empirically established fact, and it is thus mandatory for any successful model of galaxy 
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formation and evolution to explain it. The future of this field of research might thus still be full 
of exciting surprises for astronomers, cosmologists, and theoretical physicists. 
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Table 2: Observational tests of MQND. 



Observational Test 


Successful 


Promising 


Unclear 


Problematic 


Rotating Systems 

solar system 

galaxy rotation curve shapes 
surface brightness oc E oc 
galaxy rotation curve fits 
fitted M* /L 


X 
X 
X 
X 




X 




Tully— Fisher Relation 

baryon based 

slope 

normalization 

no size nor S dependence 

no intrinsic scatter 


X 
X 
X 
X 
X 








Galaxy Disk Stability 

maximum surface density 
spiral structure in LSBGs 
thin & bulgeless disks 


X 
X 


X 






Interacting Galcixies 

tidal tail morphology 
dynamical friction 
l^iHal dwarfs 


X 


X 


X 




Spheroidal Systems 

star clusters 

1 1 1 t.fp f a i n t H wa rfs 

dwarf Spheroidals 

ellipticals 

Fal)er-. Jackson rolatiou 


X 

X 

X 




X 
X 




Clusters of Galaxies 

dynamical mass 
mass-temperature slope 

velocity (bulk & collisional) 


X 


X 




X 


Grravitational Lensing 

strong lensing 

weak lensing (clusters & LSS) 


X 




X 




Cosmology 

expansion history 
geometry 

big bang nucleosynthesis 


X 




X 

A 




Structure Formation 

galaxy power spectrum 
empty voids 
early structure 




X 
X 


X 




Background Radiation 

first:sccond acoustic peak 
second:third acoustic peak 
detailed fit 
early re-ionization 


X 
X 






X 
X 
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